Methods and compositions for production of mycoplasmal adhesins

ABSTRACT

The molecular cloning and nucleotide sequence of the complete structural gene encoding Mycoplasma pneumoniae P1 cytadhesin and the amino acid sequence of the protein is described. The present invention provides recombinant DNA clones encoding the complete P1 protein as well as clones expressing P1 polypeptides with cytadhesin epitopes. The substantially purified nucleic acid molecules, recombinant vectors, recombinant cells, and recombinant polypeptides of the present invention are useful as hybridization probes and immunodiagnostic reagents and may be used to prepare anti-mycoplasmal vaccines.

The Government may own certain rights in this invention pursuant toNational Institute of Health, Grant Number AI 18540, awarded by theDepartment of Health & Human Sciences.

This application is a divisional of U.S. Ser. No. 07/118,967, filed Nov.10, 1987 now U.S. Pat. No. 5,026,636.

This application is related to co-pending U.S. application Ser. No.07/004/767, which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. FIELD OF THE INVENTION

This invention relates to the molecular cloning of the gene encodingMycoplasma pneumonias P1 cytadhesin protein. This protein mediatesmycoplasmal colonization of host respiratory epithelium and is acritical virulence determinant. By the present invention, a complete DNAsequence of the complete P1 gene as well as a deduced amino acidsequence of the P1 cytadhesin protein is presented for the first time.In addition, clones expressing M. pneumonias peptides are provided.Those peptides contain the functional cytadhesin epitopes and have beenused to localize the cytadhesin binding domain of P1.

2. DESCRIPTION OF THE RELATED ART

M. pneumonias is a non-invasive pathogen that colonizes the mucosalsurface of the respiratory tract and causes a primary, atypicalpneumonia. Although this disease appears to occur most frequently inyoung adults and children, its incidence in the general population maybe underestimated because the symptoms are often relatively mild anddiagnostic procedures are suboptimal.

M. pneumonias initiates infection by colonizing cells of the respiratoryepithelium. This colonization is mediated by a specialized tip-likeorgannelle containing clusters of a surface-localized, trypsin sensitiveprotein designated P1. Numerous studies show P1 to be a criticalvirulence determinant. For example, mutants of M. pneumonlae that lackP1 or are unable to mobilize and anchor P1 at the tip are avirulent. Inaddition, treatment of virulent M. pneumonias with trypsin abrogatesadherence to the respiratory epithelium. Finally, monoclonal antibodiesto P1 have been shown to block M. pneumoniae cytadherence. Plummer, etal., Infect. Immun., 53:398-403 (1986).

Unfortunately, despite the critical importance of P1 as a mycoplasmalvirulence determinant, efforts to provide a cloned gene encoding the P1cytadhesin have been generally unsatisfactory. For example, Trevino, etal., Infect. Immun., 53:129-134 (1986), describe an attempt to clone M.pneumonias antigens by constructing an M. pneumonias genomic libraryusing lambda phage EMBL3 as the vector and immunoscreening the librarywith adsorbed anti-M. pneumonias serum. Although this procedure producedseveral clones exhibiting antigenic cross-reactivity with M. pneumoniasP1, none of the clones reacted with monoclonal antibodies specific forcritical antigenic determinants of P1 shown by the present inventors tomediate cytadherence. Moreover, the largest immunoreactive proteinidentified had a molecular weight of only 140 kDa. In contrast, nativeP1 has a molecular weight of approximately 165 kDa. Therefore, it couldnot be definitely established whether or not the 140 kDa protein was aproduct of the structural P1 gene. The approach was then abandoned.

Since the P1 cytadhesin is probably the most important mediator ofmycoplasma cytadsorption, further elucidation of the structure of thismolecule is likely to provide information essential for a completeunderstanding of the role of cytadherence in pathogenesis of mycoplasmaldisease. This goal can be achieved most readily by cloning andsequencing the structural gene encoding P1. Furthermore, recent studieshave shown that adherence of mycoplasma to respiratory epithelium can beinhibited by certain antibodies directed against cytadhesin epitopes ofP1. Therefore, vaccines comprising recombinant P1 protein or selectedcytadhesin polypeptides derived from recombinant P1 are likely to proveeffective in preventing mycoplasmal infection. In addition, theavailability of the complete gene sequence and deduced amino acidsequence for M. pneumonias P1 will allow one to map critical antigenicepitopes and produce selected synthetic peptides useful as diagnosticprobes or vaccines.

SUMMARY OF THE INVENTION

By the present invention, the cloning and DNA sequencing of the completeP1 gene is described for the first time. In addition, the complete aminosequence of the P1 protein is provided. The invention also providesrecombinant P1 polypeptides, including polypeptides expressed as fusionproteins comprising cytadhesin epitopes. Accordingly, in a general andoverall scope, the present invention comprises recombinant clonesencoding P1, recombinant DNA sequences suitable for use as hybridizationprobes to assist cloning of genes encoding P1 and other mycoplasmalcytadhesins, methods for isolating such genes, and recombinant P1polypeptides.

More particularly, the invention relates to substantially purifiednucleic acid molecules comprising a nucleotide sequence encoding the P1protein or portion of the C-terminal portion thereof. Of course,absolute purification of the nucleic acid molecule is not necessary.Rather, the term "substantially purified" is intended to distinguish theclaimed species from species found in nature. Moreover, it will beappreciated that there is no requirement that the nucleic acid encode acomplete P1 protein. All that is required is that the molecule encode atleast a portion of the C-terminal portion of the P1 protein. For thepurposes of the present invention, a C-terminal portion of P1 is definedas the portion of P1 encoded by nucleotides downstream from nucleotide2440.

In a further embodiment, the substantially purified nucleic acidmolecule encodes a P1 protein having molecular weight of about 165-170kDa. In yet still a further embodiment, the invention relates to anucleic acid molecule wherein the nucleotide sequence is defined as anucleotide sequence encoding the amino acid sequence of FIGS. 6A-6N (SEQID NO. 9 or SEQ ID NO. 10). Although the term nucleic acid is meant toinclude both ribonucleic acid (RNA) and deoxyribonucleic acid (DNA), DNAis preferred for the purposes of the present invention. Accordingly, inone embodiment, the nucleic acid is described as DNA.

In addition, the invention provides a substantially purified nucleicacid molecule comprising a nucleotide sequence encoding an M. pneumoniasP1 polypeptide having a cytadhesin epitope. For purposes of the presentinvention, a polypeptide is defined as a peptide of more than one aminoacid, and a P1 cytadhesin epitope is considered to be any P1 polypeptidewhich binds to an antibody capable of inhibiting P1 mediatedcytadherence or is itself capable of competitively inhibiting P1mediated cytadherence. For example, a more specific embodiment relatesto a nucleic acid molecule wherein the cytadhesin epitope encoded iscapable of reacting immunologically with monoclonal antibody 5B8,produced by ATCC#HB 9586.

Similarly, an additional embodiment is directed toward a nucleic acidmolecule where the cytadhesin peptide is capable of reactingimmunologically with monoclonal antibody 6E7, produced by ATCC#HB 8420.Further embodiments of the invention relate to nucleic acid moleculescomprising DNA sequences encoding M. pneumonias P1 polypeptides of atleast thirteen amino acids in length. More specifically, the inventionprovides for nucleic acid molecules wherein the P1 polypeptide comprisesat least the particular amino acid sequence of the thirteen amino acidcytadhesin epitope described by the present inventors, or the amino acidsequences corresponding to those expressed by phage clones P1-7, P1-9,and P1-10. These amino acid sequences are described by claims 8, 9, 10,and 11, respectively.

Of course, as those of skill in the art will appreciate, the DNAsequences claimed are not required to be actively expressing mycoplasmalP1 polypeptides or in a proper expression vector or expression frame toexpress the polypeptide. In addition, due to the redundancy of thegenetic code, a number of nucleotide sequences may encode the indicatedamino acid sequences. Any such nucleotide sequence is considered to bewithin the scope of the present invention. Moreover, those of skill inthe art will appreciate that, in some cases, conservative amino acidsubstitutions may allow the production of a polypeptide which has aslightly different amino acid sequence than any of those recited in theclaims, but has essentially identical function. Such polypeptides areconsidered to be functional equivalents of the polypeptides describedherein and nucleotide sequences encoding such polypeptides areconsidered to be within the scope of the present claims.

Additional embodiments of the invention are directed towards DNA vectorscomprising any one of the DNA molecules described above as well astoward bacterial strains comprising such recombinant vectors. In a moreparticular embodiment, the bacterial strain is defined as E.coli.

A further embodiment of the invention is directed towards DNA moleculescomprising a DNA sequence which includes at least a tetradecamericportion of the DNA sequence of FIGS. 6A-6N. These DNA molecules arebelieved to possess a number of utilities. First, they may be used ashybridization probes to assist in cloning the P1 gene from M.pneumonias. In addition, it is contemplated that they may be used todetect homologous nucleotide sequences in other mycoplasmal species, andthus aid in cloning of cytadhesin genes from such species, M.genitalium, for example. of course, in some cases they may also be usedto direct synthesis of the polypeptides encoded.

In more specific embodiments of this aspect of the invention, particularnucleotides sequences are claimed. For example, embodiments directed toDNA molecules wherein the DNA sequence comprises nucleotides 178 to 192or 196 to 213 of FIG. 6A are essentially directed to the 14-mer and18-mer probes encoding the amino acids shown in FIG. 2A or 2B,respectively. A further embodiment, directed towards the DNA sequencecomprising nucleotides -70 to 258 of the DNA sequence of FIG. 6A isessentially directed towards the nucleotide sequence of the Hae IIIrestriction fragment described herein. A still further embodiment isdirected to the ECORI/Pst I restriction fragment (nucleotides -204 to911) used by the present inventors to isolate the P1 gene. Of course,yet another embodiment is directed to the DNA fragment containing thecomplete structural gene itself. Additional embodiments are directed tothe DNA molecule encoding the thirteen amino acid cytadhesin epitope(nucleotides 4148-4185) and to DNA sequence comprising the DNA insertsof phage clones P1-7 (nucleotides 4067-4185), P1-9 (nucleotides4148-4881), and P1-10 (nucleotides 4202-4881), respectively. Yet still afurther embodiment is directed to a DNA sequence comprising at least thetranscribed portion of the DNA sequence of FIGS. 6A-6N. Yet still afurther embodiment is directed to the DNA sequence of FIGS. 6A-6N.

It will be readily apparent to those skilled in the art that FIG. 6depicts both the coding and non-coding strand of the P1 DNA. It shouldbe expressly pointed out that the claims of the present invention arenot limited to double stranded DNA molecules, but are intended toencompass both double stranded and single stranded molecules, since, inmany systems, either strand may be used as a nucleotide probe and onestrand may easily be produced from its complementary strand. The onlyrequirement is that the DNA molecule include the specified nucleotidesequence.

In addition, although certain embodiments refer only to DNA molecules,those of skill in the art will also appreciate that the DNA sequences ofthe present invention can easily be transcribed into a corresponding RNAmolecule. Therefore, RNA molecules corresponding to the DNA sequences ofthe present invention are considered to be functional equivalents ofsuch DNA molecules and are intended to be encompassed by the presentclaims.

Additional embodiments of the invention relate to DNA molecules capableof hybridizing to the recombinant insert of the 6 kbp EcoRI fragmentdesignated plasmid pMPM P1 under selected hybridization conditions, saidmolecules suitable for use as hybridization probes. For example, oneembodiment is directed toward a DNA molecule capable of hybridizing tothe recombinant insert of plasmid pMPM P1, obtainable from ATCC#67560under moderately stringent hybridization conditions while anotherembodiment is directed toward a DNA molecule capable of hybridizing tothe recombinant insert of plasmid pMPM P1, obtainable from ATCC#67560under stringent hybridization conditions. For the purposes of thepresent invention, such conditions are described as moderately stringentin that they allow detection of a nucleotide sequence at least 14nucleotides in length having at least approximately 75% homology withthe sequence of the nucleotide probe used. Stringent hybridizationconditions are defined as conditions wherein the probe detectsnucleotide sequences at least 14 nucleotides in length having a homologygreater than about 90%. The conditions necessary for hybridization of aparticular probe to a particular nucleotide sequence having a specifieddegree of homology may be determined by referring to Nucleic AcidHybridization, A Practical Approach, Hames and Higgins, eds., IRL Press,Oxford and Washington, 1985, or Wood, et al., PNAS, 82:1585-1588 (1985),both incorporated herein by reference.

In addition, claims are directed toward recombinant DNA vectorscomprising the claimed DNA molecules as well as bacterial cellscomprising such recombinant vectors. In a more particular embodiment,the bacterial cells are defined as E.coli.

The invention also includes polypeptide fragments of M. pneumoniashaving M. pneumonias P1 cytadhesin epitopes. More specific embodimentsare directed toward polypeptides further defined as being capable ofimmunospecifically binding to monoclonal antibody 6E7, ATCC#HB 8420.Similarly, an additional specific embodiment is directed towardspolypeptides defined as capable of immunospecifically binding tomonoclonal antibody 5B8, ATCC# accession number HB 9586, deposited underthe Budapest Treaty with the American Type Culture Collection,Rockville, Maryland. Of course, where the recombinant polypeptide isencoded by a DNA sequence inserted in a particular type of expressionvector, the polypeptide will be expressed as a fusion protein. Forexample, when lambda gt11 is used as an expression vector, the M.pneumonias polypeptide will be expressed in the form of abeta-galactosidase fusion protein. Although such fusion proteins areconsidered to be polypeptides and, thus, intended to be encompassed bythe claims of the present, invention, one embodiment is specificallydirected to fusion proteins.

Additional claims are directed towards polypeptides comprising specificsequences as outlined in the claims. of course, recombinant polypeptidesare included within the scope of the present invention. Moreover,synthetic polypeptides can be prepared from known amino acid sequences.The present invention is also meant to encompass any syntheticpolypeptide comprising the claimed amino acid sequences or polypeptideshaving conservative amino acid substitutions and essentially identicalfunction. Additional embodiments of the invention relate to vaccinescomprising such polypeptides and methods for inducing resistance to M.pneumonias infection. Still further embodiments relate to diagnostickits comprising polypeptides having cytadhesin epitopes.

More specifically, the invention provides for a cytadhesin polypeptidecorresponding biologically to that produced by clones P1-7, P1-9 orP1-10, ATCC#40836, 40385 or 40384, respectively. For the purposes of thepresent invention, a cytadhesin polypeptide corresponding biologicallyto a polypeptide encoded by an identified recombinant vector isconsidered to be any polypeptide having similar or identical function tothat encoded by the specified recombinant vector. Such peptides mayinclude synthetic peptides, including synthetic peptides having aslightly different amino acid sequence but essentially similar function.In a more specific embodiment, the polypeptides are further defined asM. pneumonias P1 polypeptides.

With even more particularity, the invention provides for a number of DNAmolecules comprising a recombinant DNA vector which includes therecombinant inserts of phages P1-7, P1-9, P1-10, ATCC#40586, ATCC#40385,ATCC#40384, respectively. The invention also includes a recombinant DNAvector which includes a recombinant insert of plasmid pMPM P1,ATCC#67590 (pending). Bacterial strains comprising recombinant vectorswhich include such inserts are also included.

Finally, yet another feature of the present invention relates to amethod for screening mycoplasmal DNA for DNA sequences that correspondto those of M. pneumonias P1, using the novel nucleotide sequences ofthe present invention. This method essentially comprises fractionatingmycoplasmal DNA to produce DNA fragments; separating the DNA fragmentsaccording to their sizes or molecular weights; hybridizing the DNAfragments with DNA molecules provided by the present invention; andidentifying at least one fragment which hybridizes to said DNA moleculesby means of a label.

Of course, the method will prove useful for isolation of M. pneumoniasDNA sequences. However, it may also be useful for screening otherMycoplasmal species for homologous genes encoding M. pneumonias P1. Forexample, the present inventors have observed that portions of the M.pneumonias P1 gene are homologous to a gene from M. genitalium.Therefore, one embodiment of the present invention relates to screeningof M. genitalium. The specificity of the novel nucleotides probes willdepend, in part, on the hybridization conditions used. For example,where one desires to isolate nucleotide sequences encoding proteinshomologous but not identical to M. pneumonias P1, less stringenthybridization conditions should be used.

Methods which have proved particularly useful in fragmenting the DNAutilize restriction enzyme digestion or mechanical shearing. However,restriction enzyme digestion was utilized for the practice of thepresent invention. More particularly, the invention provides fordigestion of the mycoplasmal DNA with the restriction enzyme EcoRI.

The fragmented DNA can be separated into recognizable patterns usingvarious methods, the most useful of which take advantage of the varyingsizes of discrete DNA fragments. For example, DNA fragments can beseparated according to molecular weight by velocity sedimentationthrough a density gradient or, by molecular size exclusionchromatography. However, for purposes of the present invention, thepreferred technique is to separate the DNA fragments by electrophoresisthrough an agarose or polyacrylamide gel matrix.

The P1 hybridization probe can be conveniently labeled with radioactivenucleotides which allow for ready visualization of the hybridized DNA byautoradiography. Of course, other labeling techniques, including heavyisotopes or biotinylation, may also be used.

It should also be appreciated there is also no absolute requirement thatthe hybridization probes be derived from cloned M. pneumonias P1 DNA.Since the present invention provides the complete gene sequence of M.pneumonias P1, various oligonucleotide probes can be syntheticallyprepared on the basis of the disclosed sequence.

The substantially purified DNA molecules, recombinant DNA cloningvectors, recombinant cells, and recombinant proteins of the presentinvention may be used to prepare M. pneumonias P1 polypeptide fragmentsor fusion proteins suitable for use as vaccines or reagents for use indiagnostic kits. Furthermore, the substantially purified DNA sequencesof the present invention are likely to prove useful as hybridizationprobes for selectively isolating mycoplasmal cytadhesin genes.Modification of the products of the present invention so as tofacilitate their utility in these or other areas is considered to bewell within its scope.

CHARACTERISTICS OF DEPOSITED MICROORGANISMS

Recombinant lambda gt11 vectors P1-7, P1-9, and P1-10 comprising clonesP1-7, P1-9, and P1-10, respectively. These clones comprise lambda gt11bacteriophages having a mycoplasmal DNA sequence ligated into the EcoRIsite within the beta-galactosidase gene.

E.coli HB101 comprising a recombinant pUC 19 plasmid vector having amycoplasmal DNA insert approximately 6 kbp in length ligated into theEcoRI site (plasmid pMPM P1) has been deposited under the BudapestTreaty with the American Type Culture Collection, Rockville, Maryland,and assigned ATCC accession # 67560.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1--SDS-polyacrylamide gel electrophoresis analysis of proteinsamples during the purification of P1.

FIG. 1 (A) total protein extract from M. pneumonias. Arrow indicates theposition of P1;

FIG. 1 (B) same sample from FIG. 1 (A) after a single passage throughthe anti-P1 affinity column;

FIG. 1 (C) protein eluted from the anti-P1 affinity column; and

FIG. 1 (D) P1 after preparative gel electrophoresis and electroelution.Proteins were separated by 7.5% polyacrylamide gel electrophoresis andstained with Coomassie blue.

FIG. 2--The N-terminal 18 amino acid sequence of protein P1 and the14-mer and 18-mer oligonucleotide probes designed to hybridize to the P1gene. The 14-mer covers amino acids 1 to 5 and the 18-mer covers aminoacids 7 to 12. X=ACGT.

FIG. 3--M. pneumonias DNA (12 ug/lane) digested with differentrestriction enzymes and separated by 0.7% agarose gel electrophoresis.

FIG. 3 (A) standard;

FIG. 3 (B) EcoRI;

FIG. 3 (C) Hae III;

FIG. 3 (D) Pst I;

FIG. 3 (E) Hind III;

FIG. 3 (F) BamHI;

FIG. 3 (G) Kpn I; and

FIG. 3 (H) Sal I.

FIG. 4--Southern blot analysis of M. pneumonias genome. M. pneumoniasDNA was digested with Hind III, separated by 0.7% agaroseelectrophoresis and transferred to nitrocellulose paper according to themethod of Southern (Mizusawa, et al., Nucleic Acids Res., 14:1319-1324(1986)). The nitrocellulose strip was then hybridized to the 14-mer (A)and 18-mer (B) probes labeled with ³² P. A single band (4.3kb)hybridizes to both probes (arrow).

FIG. 5--Restriction enzyme map of the P1 gene. The first clone (62A)contains the 4.3 kb Hind III piece, and the second clone contains the 6kb EcoRI piece. Both the 14-mer and 18-mer probes hybridize to the DNAat a site very close to the first Sma I site. The cross-hatched boxrepresents the P1 structural gene.

FIGS. 6A-6N-- complete nucleotide sequence and deduced amino acidsequence of the P1 gene. Both the coding and non-coding strand is shown.The presumed starting codon of P1 (ATG) is numbered as 1. In the 5'flanking region, the possible promoter elements (-10 and -35) areunderlined. The 18 amino acids which match those determined by proteinsequencing of P1 are boxed (nucleotides 178-231). In the 3' flankingregion, a sequence with dyad symmetry, which may be a terminationsignal, is indicated by the arrows and the "*" indicates mismatchedsequences in this sequence. The complete P1 gene contains 4881nucleotides coding for a protein of a calculated 176,288 daltons whichincludes an apparent leader peptide (see text).

FIG. 7--Plot of hydrophilicity value versus sequence position of P1according to the method of Hopp and Woods, Proc. Natl. Acad. Sci.,U.S.A., 78:3824-3828 (1981). Hydrophilicity values are averaged over sixamino acids through the length of P1; highest positive values representcharged hydrophilic regions.

FIG. 8--Location of the ten lambda gt11 clones within the P1 structuralgene. The predicted fusion protein size and DNA insert size of eachclone are given. Molecular weight values of the M. pneumonias fusionproteins were calculated by subtracting the value of thebeta-galactosidase protein (116 kD). indicates the location anddimension of the insert size. The numbers indicate nucleotidesencompassed by each clone. A "t" indicates that the clone extendsthrough the end of the P1 gene. As indicated in text, a TGA stop codonexists just downstream from the EV site.

FIGS. 9A-9B Gene sequence and deduced protein sequence of epitopesinvolved in cytadherence by M. pneumonias. The 13 amino acids withinwhich one epitope is located are underlined. Symbols corresponds to thefollowing: , start of clone P1-7; , end of clone P1-7; *, start of cloneP1-9; and ∇, start of clone P1-10. The stop codon is indicated by thebox.

FIG. 10--Hybridization of ³² P-labeled M. pneumonias insert DNA fromclone P1-7 to M. pneumonias genomic DNA digested with EcoRI (lane A),Hind III (lane B), Pst I (lane C), Sac I (land E), and Sma I (lane E).Molecular weights in kb are shown at the left.

FIG. 11--Immunoblot of cytadhesin fusion proteins using anti-P1 MAbs.Lane A represents total M. pneumonias proteins reacted with a pool ofthe two MAbs designated 5B8 and 6E7 (see text). Lane B is thebeta-galactosidase protein reacted with a monoclonal Ab tobeta-galactosidase (Promega Biolab, Madison, Wis.). Lanes C and D areclones P1-7 and P1-9, respectively, reacted with MAb6E7. Lane E is cloneP1-10 reacted with MAb5B8.

FIGS. 12-I, 12-II, 12-III-- Immunophage blot of the ten different clonesreacted with acute (I) and convalescent (II,III) gsera of patientsinfected with M. pneumonias. Numbers 7, 9, and 10 indicate clones P1-7,P1-9, and P1-10, respectively.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention describes the isolation and nucleic acid sequenceof the gene encoding the P1 cytadhesin protein from M. pneumonias, theamino acid sequence of P1, and production of highly antigenic P1polypeptides, including fusion proteins.

The present invention is disclosed in terms of two general approachesemployed by the inventors to isolate clones and identify nucleic acidsequences encoding M. pneumonias P1 protein, or highly antigenic M.pneumonias polypeptides. The first general approach is primarilydirected toward isolating, cloning, and sequencing the complete M.pneumonias P1 gene, while the primary goal of the second approach is toidentify particular nucleotide sequences encoding the functionalcytadhesin domains of P1 and to produce antigenic cytadhesinpolypeptides suitable for use as diagnostic reagents or vaccines.

As indicated earlier, past attempts to clone the P1 gene were found tobe generally unsatisfactory. This failure was due, at least in part, tolack of a suitable method for unequivocally demonstrating that aparticular cloned DNA sequence actually represented the P1 gene.Fortunately, the present inventors have now discovered a techniqueallowing the complete structural P1 gene to be isolated and cloned. TheP1 gene has now been completely sequenced and the nucleotide sequenceunequivocally established as the structural P1 gene. In addition, theamino acid sequence of the complete P1 protein has been deduced from thenucleotide sequence.

Accordingly, the general approach described below represents aparticularly preferred approach for obtaining recombinant DNA clonescontaining the complete P1 gene. However, as illustrated below, themethod has also been successfully used for cloning partially complete P1genes.

The technique described below, disclosed for the first time by thepresent application, is one preferred method for obtaining recombinantDNA molecules and clones of the present invention. of course, variationsof this method may also allow the gene to be cloned successfully. It isalso possible that other techniques could be successfully used to cloneM. pneumonias P1. Any M. pneumonias P1 gene cloned by such procedures isconsidered to be within the scope of the present invention, unless theclaims provide otherwise.

In general, recombinant clones produced in accordance with the presentinvention are made by first isolating mycoplasmal DNA. Any mycoplasmaencoding the P1 protein, may be used as a source of DNA. However,virulent strains of M. pneumonias are preferred. These strains include,but are not limited to, M. pneumonias isolated from infected humans oranimals, as well as defined strains maintained as laboratory cultures.The strain M. pneumonias M129 is particularly preferred.

A number of methods for extracting DNA from prokaryotic organisms areknown which may, with possible routine modifications within ordinaryskill in the art, be used to extract mycoplasmal DNA. A preferred methodgenerally comprises lysing the organisms in a lysing buffer, forexample, sodium dodecyl sulfate in phosphate-buffered saline, extractingthe DNA from the lysed cell mixture with a suitable organic solvent suchas phenol, n-butanol or chloroform and reprecipitating the DNA with asuitable reagent such as ethanol, ethanol-acetate or isopropanol.

The extracted DNA is then fragmented. Any of a number of techniquessuitable for producing DNA fragments, such as mechanical shearing orpartial or complete restriction enzyme digestion, may be used. However,where a complete clone of the structural gene is desired, it isimportant to fragment the DNA so as to produce fragments at least about4.75-5.0 kb.

In general, digestion with restriction enzymes is a preferred method offragmenting the mycoplasmal DNA. Although any of a number of restrictionenzymes may be used (for example, see those listed in MOLECULAR CLONING,Maniatis, et al., Cold Spring Harbor Laboratory, Cold Spring Harbor,N.Y., pp. 93-103) under properly selected digestion conditions, thepresent inventors have discovered that the M. pneumonias genome containsan EcoRI site on either side of the P1 structural gene, but not withinthe gene itself. Therefore, when EcoRI is used as a restriction enzyme,complete digestion of M. pneumonias DNA will produce a single DNAfragment containing the entire structural gene. For this reason, it isespecially preferred that the M. pneumonias DNA be digested tocompletion with ECORI in cases where a full length structural gene isdesired.

Conversely, if one desires to obtain a fragment containing only aportion of the P1 structural gene, complete digestion with Sal I, Pst,BamHI, Kpn I, Sma I, Sac I, Hind III, or Eco RV may be used since the P1structural gene is now shown to contain restriction sites for theseenzymes. Of course, it may be also possible to use these enzymes inpartial digestion procedures in cases where a full length P1 gene isdesired.

The fragmented DNA can be separated into a recognizable pattern usingvarious methods, the most useful of which take advantage of the varyingsizes of the discrete DNA fragments. For example, DNA fragments can beseparated according to molecular weight by velocity sedimentationthrough a density gradient, or by molecular size by gel exclusionchromatography. However, according to a technique preferred for thepurposes of the present invention, DNA fragments are separated byelectrophoresis through an agarose gel matrix and then transferred to anitrocellulose sheet so that an exact replica of the DNA fragments inthe gel is transferred to the nitrocellulose sheet.

A specific labelled probe is then applied to the nitrocellulose sheetunder selected hybridization conditions so as to hybridize withcomplementary DNA fragments localized on the sheet. Although variouslabels known to those of skill in the art may be employed, a radioactiveprobe is preferred. The sheet may then be analyzed by autoradiography tolocate particular fragments which hybridize to the probe.

Various hybridization probes may be used to detect the P1 structuralgene. For example, in experiments performed in conjunction withdevelopment of the present invention, the N-terminal amino acid sequenceof the P1 protein was determined and two oligonucleotides based on thissequence (the 14-mer and 18-mer probes described in Example I) wereutilized. In subsequent experiments, an EcoRI/Pst I restriction fragmentdigested from an incomplete P1 clone (62A) was used as a hybridizationprobe. However, as those of skill in the art will appreciate, any numberof suitable hybridization probes can now be prepared, based onsecluences contained within the complete P1 gene sequence provided forthe first time by the present invention. All that is required is thatthe DNA fragments to be used as probes be of sufficient length to form astable duplex or hybrid with the P1 DNA. Such fragments are said to be"hybridizable" in that they are capable of stable duplex formation.Generally, a DNA fragment of at least fourteen nucleotides in length (atetradecamer) is capable of forming a stable duplex. of course, inpractice, it may be sometimes preferable to use a probe containing morenucleotides or alternatively, to use a battery of individual probes todetect a single fragment that hybridizes to each probe in the battery.

In any event, after DNA of the size range containing the desiredrestriction fragments is identified by the procedure generally describedabove, the fragment may be removed from the gel by a variety oftechniques known to those of ordinary skill in the art (e.g.,electroelution, dialysis, etc.) and cloned into an appropriate vector.Although it is likely that the restriction fragment containing the P1gene could successfully be cloned into several types of vectors, (e.g.,cosmids or phage), it is generally preferred to use a plasmid cloningvector, particularly where the desired restriction fragment is smallerthan 15 kbp.

After construction of recombinant vectors, the vectors are used totransform an appropriate host. In a preferred embodiment, the host is anE.coli cell of a type which is compatible with the selected vector type.However, although the present invention is disclosed in terms of E.colihost/vector systems, other host/vector systems are known in the art andmay be employed where desired. For example, see those described in DNACloning (Vol. II), P. M. Glover, ed., IRL Press, Oxford, Washington,D.C. (1985).

Transformation of host cells by the recombined vector is achieved usingstandard procedures known in the art. For example, where plasmid vectorsare employed, transformation is typically achieved by permeabilizingcompetent cells with calcium and contacting the permeabilized cells withthe recombinant vector DNA. Where bacteriophage vectors are employed,one may additionally choose to package the recombinant phage with phagecoat proteins, which affords direct transformation capability throughcell infection with a resultant increase in transformation efficiency.

Once the cells are successfully transformed with the recombinant vectorDNA, they are culture plated to provide individual recombinant clonalcolonies or plaques, a portion of which may express proteins or peptidesencoded by the M. pneumonias P1 genome. In addition, clones may be usedas a source of M. pneumonias DNA suitable for subcloning, sequencingstudies or use as hybridization probes.

The second general approach utilized by the present inventors relates tocloning and expression of M. pneumonias DNA encoding polypeptides havinga cytadhesin epitope. The polypeptides so produced may be used asdiagnostic reagents or vaccines.

The focus of this approach differs somewhat from that described above inthat it is generally directed toward, isolation and expression of M.pneumonias DNA that encodes a particular functional domain of the P1protein, the domain responsible for cytadherence. In general then, thissecond approach involves fragmenting M. pneumonias DNA by proceduressimilar to those described above and using the fragmented DNA toconstruct an M. pneumonias DNA library or clone bank which is thenscreened with a reagent specific for clones encoding cytadhesinepitopes.

The DNA libraries may generally be constructed in either plasmids orbacteriophage, however, where expression of the cloned gene sequence isdesired, it is preferred that the library be constructed in anexpression vector. The lambda gt11 expression vector is particularlypreferred where expression of the cloned gene is desired because use oflambda gt11 has been found to ameliorate several problems generallyassociated with production of foreign proteins in E.coli. (See Huynh, etal., In DNA Cloning (Vol. I), E. M. Glover, ed., IRL Press, Oxford,Washington, D.C. (1985) and incorporated herein by reference.) Ofcourse, it is contemplated that a number of other vectors could also beused to generate and/or express the M. pneumonias DNA library.

The library may be screened for clones containing the DNA sequencesencoding the cytadhesin domain of P1 by various procedures so long asthe screening reagents used allow isolation of a recombinant DNA cloneencoding at least a portion of the cytadhesin domain. For example, thepresent inventors used monoclonal antibodies previously shown torecognize the cytadhesin binding domain of M. pneumonias P1 (SeeMorrison-Plummer, et al., Infect. Immun., 55:49-56 (1987)). Notably,those antibodies do not react with the DNA clones described by Trevino,et al.

Of course, since the present disclosure describes the nucleic acidsequence of the critical regions of the P1 gene, nucleic acidhybridization probes that selectively hybridize to these regions of theP1 genome may also be used for screening. (For examples of a nucleicacid screening procedure, see Huynh, et al., In DNA Cloning (Vol. I), E.M. Glover, ed., IRL Press, Oxford, Washington, D.C. (1985)). However,where one desires to screen with specific nucleic acid probes, lambdagt10 may be a preferred vector.

Once clones containing the M. pneumonias cytadhesin epitopes areisolated, they may then be expanded and used as a source of M.pneumonias DNA for sequencing studies. The sequence of the DNA insertsof the selected clones can then be compared with the complete DNAsequence of the P1 gene provided for the first time by the presentinvention. In this manner, the cloned inserts can be unequivocallyidentified as encoding all or part of the P1 protein.

DNA or deduced amino acid sequences from a battery of clones may then becorrelated with the antigenic phenotype of the polypeptides produced bysuch clones to precisely map the location of nucleotide sequencesencoding particular antigenic epitopes. Moreover, certain monoclonalantibodies specific for the P1 protein have been shown to inhibitcytadherence of M. pneumonias and, therefore, are specific for thefunctional domain of P1 that mediates cytadherence. When thesemonoclonal antibodies are used for screening, the epitopes involved inmediating cytadherence can be mapped as well.

The recombinant DNA clones encoding all or part of the functional domainresponsible for cytadherence are particularly valuable. First, thepeptides expressed by such clones may be used as immunodiagnosticreagents to detect M. pneumonias infection. More importantly, thepeptides may be incorporated into an antimycoplasmal vaccine. Inaddition, antigenic peptides comprising the cytadhesin specific epitopescan be synthesized, on the basis of the amino acid sequences deducedfrom the mapped nucleotide sequence and used as vaccines or antigens forimmunodiagnostic tests.

Finally, it should be pointed out that, for practical reasons, it mayoften be easier to demonstrate the P1 cytadhesin epitopes using amonocional antibody since polyclonal antiserum will usually containantibody molecules specific for regions of the P1 protein not associatedwith the cytadhesin domain as well as antibody molecules specific forcytadhesin epitopes. However, polyclonal antiserum capable of inhibitingP1 mediated cytadherence may also be used to demonstrate presence of thecytadhesin epitopes by a number of techniques generally known to thoseof skill in the art. For example, selected P1 polypeptides may be usedto extensively adsorb the polyclonal antiserum and adsorbed andnonadsorbed antiserum compared for the ability to inhibit cytadherence.By this procedure, specific polypeptides capable of significantlyreducing the antibody mediated inhibition of P1 mediated cytadherencemay be considered to express cytadhesin epitopes. In addition,cytadhesin epitopes may be demonstrated directly by their ability tocompetitively inhibit P1 mediated cytadherence in any of a number ofexperimental systems commonly used to measure cytadherence, described byMorrison-Plummer, et al., Infect. Immun., 53:398 (1986), or Krause andBaseman, Infect. Immun., 39:1180-1186 (1983).

Although the methodology described herein contains sufficient detail toenable one skilled in the art to practice the present invention, acommercially available technical manual entitled MOLECULAR CLONING(Maniatis, et al., Cold Spring Harbor Laboratory, Cold Spring Harbor,New York) may provide additional details useful to assist practice ofsome aspects of the invention. Accordingly, this manual is incorporatedherein by reference.

The following examples are designed to illustrate certain aspects of thepresent invention. However, they should not be construed as limiting theclaims thereof.

EXAMPLE I Isolation of a Recombinant Clone That Contains a DNA SequenceEncoding M. pneumoniae P1

This example is designed to illustrate the actual steps followed by theinventors in obtaining a specific recombinint clone that contained a DNAsequence encoding the mycoplasma P1 protein. However, this example isnot meant to represent the only procedure for cloning the P1 gene.

A. Culture of Mycoplasma And E. Coli

Virulent hemadsorbing Mycoplasma pneumonias strain M129 in the sixteenthbroth passage was grown at 37° C. in 32 ounce glass prescription bottlescontaining 70 ml of Edward medium (Edward, J. Gen. Microbial., 1:238-243(1947)). Glass adherent mycoplasmas were washed four times withphosphate buffered saline (PBS; pH 7.2) and collected by centrifugation(9,500×g 20 min.). Cells were harvested 72 hours after inoculation andstored at -70° C.

Escherichia coli strain HB101, DH5 alpha, and JM 107 were purchased fromcommercial sources and grown in LB broth (10 g/l Bacto-tryptone, 5 g/lBacto-yeast extract, 10 g/l NaCl, pH 7.5).

B. Purification of P1 Protein by Affinity Chromatography

The P1 protein was purified by antibody affinity chromatographyaccording to the method described by Leith and Baseman, J. Bacteriol.,157:678-680 (1984). Briefly, this method was as follows.

Four anti-P1 monoclonal antibodies secreted by hybridomas(Morrison-Plummer, et al., Infect. Immun., 55:49-56 (1987);Morrison-Plummer, et al., Infect. Immun., 53:398-403 (1986)) werecombined and purified by protein A-Sephadex column chromatography.Anti-P1 affinity columns were prepared by coupling 50 mg of purifiedanti-P1 antibody to 15 ml of cyanogen bromide activated Sephadex gel(Pharmacia, P1 Piscataway, N.J.).

Pellets from 100 bottles of M. pneumonias were suspended in 50 ml of 20mM Tris-HCl (pH 8.0), 0.2% sodium deoxycholate (Fisher Scientific), 0.1%sodium dodecyl sulfate (BDH Chemicals, Poole, England), 10 mM EDTA, and0.2% Triton-X-100 containing 1 mM phenylmethylsulfonyl fluoride.Solubilization of proteins was assisted by passing the cell suspensionthrough successively smaller gauge needles (22 to 27 gauge). Insolublematerial was removed by centrifugation at 100,000×g for 30 minutes.

Solubilized proteins were applied to the affinity column at 4° C. andwashed with 5 column volumes of the same buffer minus sodiumdeoxycholate. Bound protein was eluted with 0.1M acetic acid (pH 3)containing 0.15M NaCl and 0.1% SDS. The eluted protein was immediatelyneutralized with 1.0M Tris and concentrated in a pressureultrafiltration concentrator (Amicon, Denvers, Mass.).

As shown in FIG. 1 (panel's A-C), this procedure selectively enrichedfor the Mycoplasma pneumonias cytadhesin protein P1 (165 kilodaltons).Approximately 400 ug of P1 protein was recovered after theimmunoaffinity step from an initial M. pneumonias extract containing 300mg total protein.

As an additional purification step, the affinity column-purified P1 wasfurther processed by preparative gel electrophoresis through a 5%polyacrylamide-SDS gel. The gel was stained with Coomassie blue and theP1 protein band was cut out of the gel and electroeluted according tothe procedure of Hunkapiller, et al. (In Methods in Enzymology, C. H. W.Hirs and S. N. Timasheff (eds.) pp. 227-236 (1983)). About 60% recoverywas achieved after 24 hours of elution at room temperature in 50 MMammonium carbonate containing 0.1% SDS. The eluted protein was thenprecipitated in 80% methanol to remove SDS. SDS-PAGE analysis of therecovered P1 revealed that the sample contained intact P1 protein (FIG.1D), and the gel was deliberately overloaded to show the purity of thesample. Finally, the purified protein was shown to be P1 since itreacted with anti-P1 monoclonal antibodies in Western blot analyses(data not shown).

C. Determination of the N-Terminal Amino Acid Sequence of P1 Protein andPreparation of Specific Oligonucleotide Probes

The purified P1 protein was sequenced from the amino terminus with a gasphase protein sequencer. Approximately 50 ug of purified P1 was used(300 pmole) for each sequence analysis. Three separate determinationsyielded the sequence shown in FIG. 2.

The N-terminal amino sequence was used to deduce sequences foroligonucleotide probes. Two oligonucleotide probes complementary to allthe possible mRNA combinations encoding different portions of theprotein were synthesized, a 14-mer corresponding to amino acids 1-5 anda 18-mer corresponding to amino acids 7-12 (FIG. 2). The presentinventors used both C and T in the third position of the trypotophancodon of the 18 bp oligonucleotide in order to ensure hybridization withthe probe in the event that M. pneumonias uses TGA (a stop codon inbacterial and eukaryotic systems) rather than TGG to encode tryptophan.The oligonucleotides were synthesized in the Department of Biochemistry,Baylor College of Medicine according to a procedure similar to thatdescribed by Alvarado-Urbina, et al., Science, 214:270-274 (1981),incorporated herein by reference, and purified by electrophoresis in 20%polyacrylamide gel containing 8M urea (Berent, et al., Biotech.,3:208-220 (1985)). For use as hybridization probes, the oligonucleotideswere labeled at the 51' end with Y-P³² -ATP by the T4-polynucleotidekinase reaction (Maniatis, ei al., MOLECULAR CLONING, Cold Spring HarborLaboratory, Cold Spring Harbor, N.Y. (1982), pp. 122-127).

D. Southern Blot Analysis of M. pneumonias DNA

M. pneumonias DNA was prepared from exponentially growing cellsaccording to the following procedure. Pellets of M. pneumonias weresuspended in 2.7 ml of PBS, lysed by the addition of 0.3 ml of 10%sodium dodecyl sulfate (SDS) and incubated with 10 ug of RNase for 30minutes at 37° C. Preparations were extracted three times with an equalvolume of redistilled phenol (equilibrated with 100 mM Tris [pH 8.0] -10mM EDTA [TE]) followed by dialysis overnight at 4° C. against a total of6 liters of sterile TE. Twelve ug of DNA was digested to completion withEcoRI, Hae III, Pst I, Hind III, BamHI, Kpn I or Sal I prior toelectrophoretic separation on 0.7% agarose gels. Gels were stained withethidium bromide and photographed under UV illumination (FIG. 3).

The gels were then analyzed according to the procedure of Southern, J.Mol. Biol., 98:503-519 (1975), incorporated herein by reference.Briefly, DNA was transferred to nitrocellulose filter paper with 20×SSC(0.3M sodium citrate, pH 7.0, 3M NaCl), rinsed once with 6×SSC, thenbaked at 80° C. for 2 hours under vacuum. Filters were prehybridizedovernight at 37° C. in 20 ml of prehybridization solution containing6×SSC, 60 mM sodium phosphate (pH 7.0), 5×Denhardt's solution (bovineserum albumin, polyvinylpyrolidone, Ficoll at 1 mg/ml) and 0.1 mg/ml ofdenatured herring sperm DNA.

Hybridizations with the 14 base pair [bp] and 18 base pair [bp]oligonucleotide probes were carried out for 12 hours in 10 ml ofprehybridization solution plus 10% dextran sulfate and ³² P labeledoligonucleotide probes (3×10⁸ cpm) at 25° C. (14 bp, 14-mer) or 37° C.(18 bp, 18-mer). After incubation, filters were rinsed twice with 6×SSCat 4° C. (30 min. each), then washed twice in wash solution (3Mtetramethylammonium chloride, 50 mM Tris-HC1, pH 8.0, 2 mM EDTA, 0.1%SDS) at the appropriate temperature (14-mer at 37° C. and 18-mer at 45°C.) for 20 min. according to the procedure of Wood, et al., Proc. Nat.Acad. Sci., U.S.A., 82:1585-1588 (1985). After washing, filters wererinsed in 6×SSC at 4° C., dried and exposed to X-ray film using anintensifying screen.

Both probes hybridized to several DNA bands in each digestion, possiblybecause the probes were comprised of a mixture of oligonucleotidesformulated to react with all possible nucleotide sequences that couldencode the 12 N-terminal amino acids. A 4.3 kb Hind III fragmenthybridized most intensely to both the 14-mer and 18-mer (FIG. 4)strongly implicating this DNA fragment as containing the N-terminalsequence of P1.

E. Cloning DNA Fragments Encoding M. pneumonias P1 Protein

To clone the DNA fragment described above, M. pneumonias DNA wasdigested with Hind III, separated by agarose gel electrophoresis, andstained briefly with ethidium bromide. DNA in the 4.3 kb size range waseluted from the gel by electrophoresis onto DE-81 paper, eluted from thepaper with 20 mM Tris-HCl, pH 8.0, and 1.5M NaCl, then precipitated withethanol and redissolved in TE buffer.

The DNA was then ligated into the Hind III site of pUC 9. For thisprocedure, the plasmid was digested with an appropriate restrictionenzyme (Hind III) and the 51' end phosphate removed by calf intestinalalkaline phosphatase according to the procedure described on page 133 ofManiatis, et al., MOLECULAR CLONING, Cold Spring Harbor Laboratory, ColdSpring Harbor, N.Y. (1982). Mycoplasma DNA and vector were mixed at 1:1molar ratio and ligated at room temperature for 4 hours with T₄ DNAligase. After incubation, the reaction was stopped by adding EDTA to 10mM, diluted 5-fold with distilled H₂ O.

The ligated plasmid DNA was then used to transform competent HB101 orDH5 alpha E.coli cells according to the manufacturer's instructions(BRL, Bethesda, Md.). Transformants were selected on LB agar platescontaining 50 ug/ml of ampicillin. About 5,000 transformants wereobtained, of which 200 individual colonies were picked and grownovernight in 5 ml of LB broth containing 50 ug/ml of ampicillin. PlasmidDNA was isolated from overnight cultures by the alkaline lysis method(Ish-Horowicz and Burke, Nucleic Acid Res., 9:2989-2998 (1981)) andanalyzed on agarose gels.

To determine which insert-containing plasmids carried the P1 gene, DNAsfrom about 40 plasmids with inserts in the 4-5 kb range were blottedonto nitrocellulose filters. The filters were then hybridized to the ³²P labeled 14-mer and 18-mer oligonucleotide probes, washed and exposedto film as described above. Three clones hybridized strongly to bothprobes. By restriction endonuclease analysis the three clones containedthe same insert designated 62A (FIG. 5).

The DNA sequence which hybridized to both probes was narrowed to a 350bp Hae III restriction fragment by digesting the 62A plasmid with theHae III, separating the DNA on a 5% polyacrylamide gel, and transferringthe DNA from the gel onto nitrocellulose paper for hybridization witheach individual probe (data not shown). The 350 bp Hae III piece wassubcloned into M13mpl8 and its sequence determined. It contains both the14-mer and 18-mer sequences, and most importantly the DNA has an openreading frame which codes for the 18 amino acids found by sequencing theamino terminus of the P1 protein (FIGS. 6A-6N). Thus, clone 62A wasshown to contain at least a part of the structural gene encoding P1.

However, based upon the location of the sequenced Hae III fragment inthe 62A clone, the 4.3 kb Hind III DNA fragment was not large enough toencode the entire 165 kDa P1 protein. Therefore, an EcoRI/Pst Irestriction fragment from 62A was used to clone a larger DNA fragment.This procedure was performed as follows:

Plasmid 62A was isolated from overnight cultures by the alkaline lysismethod (Ish-Horowicz and Burke, Nucleic Acids Res., 9:2989-2998 (1981))and digested to completion with a mixture containing 500 units EcoRI and500 units Pst I. The resulting restriction fragment was purified byagarose gel electrophoresis, labeled by nick translation (Maniatis, etal., MOLECULAR CLONING, Cold Spring Harbor Laboratory, Cold SpringHarbor, N.Y. (1982), pp. 109-112) and used to probe Southern blots of M.pneumonias DNA digested to completion with EcoRI. This procedure wasperformed essentially as described above, except that the hybridizationconditions were more stringent including a higher temperature ofhybridization and wash (65° C.).

By this procedure, an M. pneumonias DNA fragment approximately 6 kbp wasdetected. Accordingly, DNA in this size range was eluted from an agarosegel of the EcoRI-digested DNA by electrophoresis onto DE-81 paper,eluted from the paper with 20 mM Tris-HC1, pH 8.0, and 1.5M NaCl, thenprecipitated with ethanol and redissolved in TE buffer.

The DNA was then ligated into the EcoRI site of pUC 19, essentially asdescribed above and used to transform E.coli, as described above.Restriction enzyme analysis of the cloned insert indicated that the 6kbp insert overlapped clone 62A and was sufficiently large to encode theentire P1 protein. The restriction enzyme map depicting both the 4.3 kbpHind III fragment and the 6 kbp EcoRI fragment is shown in FIG. 5.

EXAMPLE II Determination of the Complete DNA Sequence of the GeneEncoding Mycoplasma pneumoniae P1 Amino Acid Sequence of the P1 ProteinA. Sequencing of the P1 Gene

DNA sequences were determined by the dideoxy-chain-termination method ofSanger, et al., Proc. Natl. Acad. Sci., U.S.A., 74:5463-5467 (1977). M13sequencing kits were purchased from BRL and the reactions were performedaccording to the manufacturer's instructions except deoxy-7-deaza GTP(Boehringer Mannheim, Indianapolis, Ind.) was used in sequencingreactions in place of dGTP (Messing, et al., Nuc. Acid Res., 9:309-321(1981)). Some DNA fragments were sequenced by subcloning appropriaterestriction enzyme fragments into an M13 phage vector (Messing, et al.,Nuc. Acids Res., 9:309-321 (1981)) and the single strand DNA purifiedfor use as a sequencing template. To sequence the rest of the P1 gene, alarge piece of DNA from the Pst I to the Sal I (see FIG. 5) wassubcloned into an M13 vector and a series of deletions from the 3' endwere generated by treating the double strand DNA with exonuclease IIIaccording to the method of Heinkoff, Gene, 28:351-359 (1981). Subcloneswith progressive deletions were selected for use as sequencingtemplates. Both strands of the entire P1 gene were sequenced. Nucleicacid and protein computer analyses were performed using the Microgenieprogram (Beckman, Palo Alto, Caif.). Comparisons of the P1 DNA anddeduced protein sequences were to the most recent releases of the NIHGenbank DNA sequence database and the National Biomedical ResearchFoundation protein sequence database, respectively.

B. Analysis of the P1 Nucleotide Sequence

The nucleotide sequence of the P1 gene is shown in FIGS. 6A-6N. There isan open reading frame of 4881 nucleotides and at the end of the gene isa TAG stop codon followed by 2 in-frame TAA stop codons 21 and 27 bpdownstream. This sequence could encode a protein of 1627 amino acidswith a calculated molecular weight of 176,288.

The nucleotide sequence includes a possible in frame translationinitiation site, ATG, 177 nucleotides from the P1 N-terminal sequence.There are conventional transcription initiation sites at -35 and -10upstream with a distance of 14 nucleotides between these two consensussequences (Reznikoff, et al., Ann. Rev. Genet., 19:355-387 (1985)), butno ribosomal binding site is observed between -10 and the initiationcodon. This predicts a protein with an extension of 59 amino acids fromthe N-terminus. Another possible translation initiation codon is the GTG(Gold, et al., Ann. Rev. Microbiol., 35:365-403 (1981)) at position 91.Use of this initiation site would predict a 28 amino acid precursor.

The open reading frame contains the 18 amino acids identified by gasphase sequencing (FIGS. 6A-6N, Box). Comparison of the gas phasesequence with the nucleotide sequence demonstrates that the inventors'hunch that M. pneumonias might use this codon to encode tryptophan wascorrect.

Moreover, it was observed that the 18 amino acids are found at position60-77 of the deduced protein instead of at the amino terminus of theopen reading frame. The reason for this apparent discrepancy could wellbe that P1, like many outer membrane proteins, is initially synthesizedas a precursor (Oliver, Ann. Rev. Microbial., 39:615-648 (1985)).Consistent with this hypothesis is the observation that the extra 59amino acids found at the amino terminus of the deduced protein appearlike a signal peptide; they include positively charged amino acidsfollowed by a stretch of hydrophobic amino acids (Oliver, Ann. Rev.Microbial., 39:615-648 (1985)). If protein P1 is indeed synthesized as aprecursor and processed into a mature protein, then the molecular weightof the mature protein would be 169,758 which is very close to the 165kDa reported earlier [Baseman, et al., J. Bacteriol., 151:1514-1522(1982); Krause, et al., Infect. Immun., 35:809-817 (1982); Leith andBaseman, J. Bacteriol., 157:678-680 (1984); and Morrison-Plummer, etal., Infect. Immun., 55:49-56 (1987)] and almost identical to the value(168 kDa) determined by Jacobs, et al., J. Clin. Microbiol., 23:517-522(1986) on SDS-PAGE.

Other relevant features of the sequence include a typical enbacterialpromoter (Reznikoff, et al., Ann. Rev. Genet., 19:355-387 (1985)) forRNA polymerase which is upstream of the first ATG codon, atapproximately -35 and -10. Also, a not-so-perfect invert repeat sequenceis detected 19 base pairs downstream from the TAG stop codon. Theinverted repeat sequence is a common feature of an RNA terminator(Rosenberg and Court, Ann. Rev. Genet., 13:319-353 (1979)). However, notypical ribosomal binding site is observed between -10 and theinitiation codon.

C. Determination of the Amino Acid Sequence of the P1 Protein

The complete amino acid sequence of the M. pneumonias (FIGS. 6A-6N) P1protein was predicted from the DNA sequence, also shown in FIGS. 6A-6N.The predicted amino acid sequence is consistent with availableinformation about protein P1; the predicted molecular weight of P1approximates the reported values; and the predicted N-terminal aminoacid sequence fits exactly with the gas phase sequence analysis ofpurified P1 protein. The predicted P1 sequence contains more basic aminoacids (Arg+Lys+His=169) than acidic (Asp=Glu=143) (isoelectric focusingdata shows that P1 has an isoelectric point at a basic pH). Thepredicted P1 contains no cysteine and thus has no intramoleculardisulfide bonding, a finding which correlates with the previousobservation that the P1 position in polyacrylamide gels is not changedafter exposure to sample buffer containing reducing agents.

By referring again to FIGS. 6A-6N, it can be seen that the predicted P1protein has several other interesting features: a) it contains highpercentages of hydroxy amino acids (17.7% are serine and threonine); andthe high proline content (13 of 26 amino acids) at the carboxy terminusis unusual and may place structural restraints on the protein and assistin regulating the topological organization of the cytadhesin in themembrane [Baseman, et al., J. Bacteriol., 151:1514-1522 (1982); Baseman,et al., In Molecular Basis of Oral Microbial Adhesion, S. E. Mergenhagenand B. Rosan (eds.), (1985); Kahane, et al., Infect. Immun., 49:457-458(1985); and Krause, et al., Infect. Immun., 35:809-817 (1982)].

It should be noted that FIGS. 6A-6N display the actual nucleotidesequence determined by sequence analysis of the 6 kbp EcoRI fragment(plasmid pMPM Pl) insert obtainable from ATCC#67560. As those of skillin the art will appreciate, due to the redundancy of the genetic code,numerous other nucleotide sequences may be constructed which code forthe same amino acid sequence. Therefore, any nucleic acid sequenceencoding for the M. pneumonias P1 protein as depicted in FIGS. 6A-6N ismeant to be included within the scope of the present invention. Thisincludes nucleotide sequences containing either the mycoplasmal (TGA) ortraditional (TGG) tryptophan codons.

D. Homology between M. pneumonias and Other Proteins Having Known AminoAcid Sequences

The deduced amino acid sequence for the P1 protein was compared to knownamino acid sequences listed in the National Biomedical ResearchFoundation protein sequence database. This analysis revealed that thepredicted P1 sequence is homologous to coat protein A of bacteriophageIke (protein P1 amino acid numbers 1308 through 1322 compared tobacteriophage amino acid numbers 240 through 254, 73.3% homology;257-290 vs. 231-264, 41.2% homology), protein 3A of Brome Mosaic virus(956-979 vs. 133-159, 52% homology), coat protein vp2 and vp3 of mousepolyomavirus (733-746 vs. 24-38, 66.7% homology), and coat protein Aprecursor of bacteriophage fd, M-13 and F1 (1296-1330 vs. 245-280, 51.3%homology). The 1290-1350 region of P1 also shares extensive homologywith cytoskeletal keratin of mammalian species. In addition, two regionsof P1 share extensive homology with human fibrinogen alpha chainprecursor (337-352 vs. 338- 354, 70.6% homology; 822-852 vs. 544-565,59.1% homology). It is fascinating that parts of the P1 sequence arehomologous to specific viral coat proteins, mammalian cytoskeletalkeratin and to human fibrinogen alpha chain precursor. These findingsmay help explain observations of autoimmune-like mechanisms ofphysiopathology associated with mycoplasma disease (Biberfeld, S., Clin.Exp. Immunol., 8:319-333 (1971); Wise and Watson, Infect. Immun.,48:587-591 (1985)).

E. Analysis of Individual Antigenic Determinants Within the P1 Moleculeby Hydrophillicity Plotting

Cytadhesin P1 is strongly immunogenic and the appearance of anti-P1antibodies correlates with resolution of the atypical pneumonias inducedby M. pneumonias. Therefore, the recombinant P1 protein or selectedpeptides derived from the P1 protein provide attractive vaccinecandidates. The present inventors have performed experiments directedtowards mapping individual antigenic sites within the protein. oneapproach is used to map the antigenic sites and is described below.

In general, antigenic sites are usually hydrophilic. Therefore, wherethe amino acid sequence of a protein is known, hydrophilicity plots maybe constructed which allow one to predict the location of antigenicdeterminants (Hopp and Woods, Proc. Natl. Acad. Sci., U.S.A.,78:3824-3828 (1981)). Hydrophilicity plotting of the predicted M.pneumonias P1 sequence was performed using the Microgenie programobtained from Beckman (Palo Alto, Calif.). This analysis revealedpotential antigenic sites (FIG. 7) at positions 240-260, 280-304,314-333, 450-479, 680-690, 746-767, 898-913, 1244-1260, and 1476-1485.

EXAMPLE III Expression of the Complete Recombinant P1 Protein

The following prophetic example is intended to describe methods by whichthe P1 gene could be expressed to provide a complete P1 protein.

The P1 protein could be expressed by ligating the piece of DNA thatincludes the first Hind III site through the second EcoRI site (see FIG.5) to a mycoplasma compatible vector, such as E.coli plasmid pAM120,then transforming fast growing mycoplasma species (such as Acholeplasma)for production of large quantities of P1. (See Dybvig, K., et al.,Science, 235:1392 (1987), which is incorporated herein by reference.)

The P1 gene could also be modified to express whole P1 in E.coli. Allthe UGA codons in the structural gene of P2 could be changed into UGG bysite specific mutagenesis. See Shortle, D., et al., Meth. in Enzymol.,100:457 (1983), which is incorporated herein by reference. Then apowerful E.coli promoter such as the lac promoter could be ligated tothe P1 gene to overproduce P1. Alternatively, an E.coli strain with UGAsuppressor phenotype (Raftery, L., et al., J. Bacteriol., 158:849(1984), which is incorporated herein by reference) could be used as hostto express the unmodified P1 gene.

Also, the P1 gene promoter is a unique mycoplasma promoter which can beused for the expression of other proteins in mycoplasma species.

EXAMPLE IV Cloning, Sequecing, and Expression of Nucleotide SequencesEncoding the Functional Cytadhesin Binding Domain of M. pneumoniae

This example describes the construction of the lambda gt11 recombinantDNA expression library of M. pneumonias used to characterize the P1domain involved in cytadherence. In general, clones expressing P1epitopes were identified by screening the library with two anti-P1monoclonal antibodies known to block M. pneumonias attachment toerythrocytes (RBCS) and respiratory epithelium.

A. Construction of the Lambda gt11 Library 1. Bacteria, Vector, andRestriction Enzymes

M. pneumonias strain M129-B16 was cultured as described in Example I.E.coli Y1088 (American Type Culture Collection (ATCC#37195), Y1089(ATCC#37196), and Y1090 (ATCC#37197) were cultured in LB medium. Thesecell lines are available through the American Type Culture Collection orfrom Clontech Laboratories (Palo Alto, Calif.).

Lambda gt11 DNA arms and phage extracts were purchased from PromegaBiotech (Madison, Wis.). Enzymes used for constructing the genomiclibrary were from New England Biolabs (Beverly, MS); restriction enzymeswere from BRL (Gaitherburg, Md.).

2. Construction of the M. pneumoniae Genomic Library in Lambda gt11

M. pneumonias strain M129-B16 genomic DNA library was constructed in theexpression vector lambda gt11 according to general procedures describedby Young and Davis, Proc. Natl. Acad. Sci., 80:1194-1198 (1983) andScience, 222:778-782 (1983) incorporated herein by reference.

More specifically, mycoplasmal DNA was extracted and fragmented asdescribed in Example I, but using mechanical shearing in place ofrestriction endonucleases.

The sheared DNA was then ligated to EcoRI linkers, and these DNAfragments were ligated into the EcoRI site in lambda gt11 armsessentially as described by Young and Davis, Proc. Natl. Acad. Sci.,80:1194 (1983) and Science, 22:778 (1983). Briefly, this procedurecomprises incubating the vector DNA and the M. pneumonias DNA fragmentsat high vector/insert ratio of 2:1 in ligation buffer (0.066 M Tris-HCl,pH 7.5; 5 mM MgCl₂ ; 5 mM DTT; 1 mM ATP) with 1U T4 DNA ligase at 12° C.for 2-16 hours.

Recombinant DNA was packaged to provide viable phage according toinstructions provided by the commercial supplier of the phage arms andphage extracts (Promega Biotech, Madison, Wis.). Alternatively,packaging extracts may be prepared and packaging reactions carried outaccording to protocols described on pages 256-268 of MOLECULAR CLONING.

The phage may then be titered by plating a small number of phage fromthe packaging mix (about 100) on E. coli Y1088 at 42° C., using 2.5 mlLB soft agar (pH 7.5) containing 40 ul of 40 mg/ml×gal and 40 ul of1MPTG for a 90 mm Petri dish. Plaques produced by the parental lambdagt11 phage are blue, while plaques produced by the recombinant phage arecolorless. (In a few cases, particular recombinant phage plaques willproduce a slight amount of blue color.)

The library may then be amplified by plating out the library at adensity of 10⁶ p.f.u. per 150 mm Petri dish, using 600 ul of Y1088plating cells per dish and fresh LB plates and incubating at 42° C.Plate stocks may be prepared as described by Davis, et al., BacterialGenetics, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.(1980).

Alternatively, it is possible to screen the lambda gt11 library withoutamplification. For this procedure, 0.1 ml Y1088 plating cells areinfected with ≦10 plaque forming units at 37° C. for 15 minutes. Then0.5 ml of Y1090 plating cells and 7.5 ml LB soft agar are added. Themixture is poured into a two-day old 150 mm LB plate (pH 7.5).

B. Screening Lambda gt11 M. pneumonias DNA Libraries with MonoclonalAntibody Probes

The M. pneumonias DNA phage library was screened with a pool of twoanti-P1 monoclonal antibodies directed against unique M. pneumoniasepitopes involved in cytadherence. The screening procedure was generallyperformed as follows.

E. coli Y1090 was grown to saturation in LB (pH 7.5) at 37° C. and 0.6ml of the Y1090 culture was mixed with up to 10⁵ p.f.u. in lambdadiluent for each plate. The phage were absorbed to the cells at 37° C.for 15 minutes. Then 7.5 ml of LB soft agar (pH 7.5) was added to theculture and the mixture was poured onto an LB plate (pH 7.5). The plateswere incubated at 42° C. for 3-4 hours and then placed at 37° C. Eachplate was then overlayed with a dry nitrocellulose filter disk which hadbeen saturated in 10 mM IPTG in water. The plates were then incubatedfor an additional 2-3 hours at 37° C. and removed to room temperature.The filters were then removed from the plate and the following stepswere performed.

First, the filters were rinsed briefly in TBS (50 mM Tris-HCl, pH 8.0,150 mM NaCl) and incubated in TBS plus 20% fetal calf serum for 15-30minutes. The filters were then incubated in TBS plus 20% fetal calfserum plus a mixture containing 1 ug/ml MAb6E7 and 2 ug/ml MAb5B8 forone hour. Preparation of these antibodies is described in Plummer, etal., Infect. Immun., 53:398-403 (1986), incorporated herein byreference.

The filters were then washed in TBS for 5-10 minutes, washed again inTBS plus 0.1% NP-40 for 5-10 minutes, rewashed in TBS alone for 5-10minutes, rinsed briefly in TBS plus 20% fetal calf serum, andtransferred to TBS plus 20% fetal calf serum containing horseradishperoxidase-conjugated to goat anti-mouse immunoglobulin. The filterswere then washed again in TBS, TBS plus 0.1% NP-40, and TBS. The filterswere dried and 4-chloro-1-naphthol was used as substrate to develop theimmunoblots.

When the lambda gt11 M. pneumonias genomic library was screened with thetwo monoclonal antibodies, ten independent clones that produced strongsignals were isolated. Eight of the clones reacted with both monoclonalantibodies, one clone (P1-7) reacted only with MAb6E7 and another clone(P1-10) reacted only with MAb5B8. The nucleotides encompassed by each ofthese clones is indicated by FIG. 8.

C. Analysis of the Recombinant Phage Clones

The following experiments were performed in order to furthercharacterize the mycoplasmal proteins produced by the recombinant phage.Positive signal-producing phage were grown in E. coli Y1090 as describedin MOLECULAR CLONING, pp. 64-65. DNA was extracted by a rapidsmall-scale plate lysate method using 2 units of EcoRI to excise the M.pneumonias DNA inserts essentially as described in MOLECULAR CLONING,pp. 371-372.

1. Sequencing of the M. pneumonias DNA Inserts

DNA sequences of the recombinant phage inserts were determinedessentially as described in Example II. The results of this analysis areshown in FIGS. 8 and 9A-9B. By comparing the sequences of these clonesto the complete P1 gene sequence (FIGS. 6A-6N), the cytadhesin bindingdomain of the M. pneumonias P1 protein was mapped to the C-terminalregion of the P1 gene. The sequences of three clones were of particularutility in further mapping antigenic epitopes of P1. These clones wereP1-7, P1-9, and P-10. As shown in FIGS. 9A-9B, clone P1-7 starts atposition 4067 and ends at position 4185; clone P1-9 starts at position4148 and extends beyond the end of the P1 gene. These two clones bothcontain nucleotides 4148-4185. These nucleotides code for a P1polypeptide thirteen amino acids in length, the thirteen amino acidsthat contain the epitope reactive with the cytadherence-blocking MAb6E7.Clone P1-10 starts at position 4202 and extends beyond the P1 gene. Thisclone is nonreactive with MAb6E7, yet shares a stretch of nucleotidesthat overlap with clone P1-9; further demarcating the thirteen aminoacid cytadherence related epitope.

Therefore, a key domain of adhesin P1 that mediates the cytadherence ofvirulent M. pneumonias to respiratory epithelium has been mapped to athirteen amino acid region located in the C-terminal end of the P1molecule. In addition, the present studies have established that asecond cytadhesin epitope, recognized by MAb5B8, is C-terminal toposition 4202. Therefore, the C-terminal end of the P1 protein appearsto be the primary effector region of the P1 molecule. It is interestingthat the carboxy terminus of the P1 protein is proline rich (13 of thelast 26 amino acids are proline). This hydrophobic domain may functionto anchor the carboxy terminal end of the P1 molecule in the M.pneumonias membrane.

2. The Thirteen Amino Acid Cytadhesin Epitope is Unique to M. pneumoniasP1

By comparing the sequence of the P1-7 probe to the known DNA sequence ofthe complete P1 gene, it was determined that thd P1 molecule containedonly one copy of the thirteen amino acid epitope described above.However, it was of interest to determine whether or not this epitope wasunique to M. pneumonias. Therefore, the following experiment wasperformed.

Mycoplasma DNA was digested with different restriction enzymes (BamHI,EcoRI, Hind III, Pst I, Sac I, Sma I) and fractionated by agarose gelelectrophoresis, essentially as described in Example ID above. However,the DNA insert from clone P1-7 was used as a hybridization probe.Hybridization was carried out at 68° C. overnight according to Maniatis,et al., MOLECULAR CLONING, Cold Spring Harbor Laboratory, Cold SpringHarbor, N.Y. (1982), pp. 382-289. The results of this procedure, shownin FIG. 10, clearly demonstrate that the cytadherence related epitope ofclone P1-7 occurs only once in the M. pneumonias genome.

D. Analysis of M. pneumonias P1 Cytadhesin Peptides

The following studies were undertaken to further characterize thecytadhesin polypeptides produced by the recombinant lambda gt11bacteriophage.

It will be appreciated by those familiar with the lambda gt11 fusionsystem, that the site used for insertion of foreign DNA is a uniqueEcoRI cleavage site located within the lacZ gene, 53 base pairs upstreamfrom the beta-galactosidase translation termination codon. Because thesite of insertion for foreign DNA in lambda gt11 is within thestructural gene for beta-galactosidase, foreign DNA sequences in thisvector have the potential to be expressed as fusion proteins withbeta-galactosidase. The position within the beta-galactosidase genechosen for fusion with foreign DNA sequences, corresponds to a regionnear the carboxy terminus of the beta-galactosidase protein.

Fusion proteins expressed by the recombinant clones of the presentinvention were analyzed by Western blotting. This procedure wasperformed essentially as follows. M. pneumonias protein (2 mg) wassuspended in 0.3 ml of PBS, and an equal volume of 100 mM Tris (pH 6.8)-2% S.D.S. -20% glycerol -2% 2-mercaptoethanol-0.02% bromophenol bluebuffer (SP buffer) was added. Samples were boiled for 5 minutes.Recombinant fusion proteins were harvested from plate lysates ofindividual clones by scraping soft agarose overlays from the plates,passing them through a 22 gauge needle into a Corex tube and eludingwith 4 m of SM buffer for two hours at 4° C. The agarose was pelleted bycentrifugation at 10,000×g for 15 minutes at 4° C. prior totrichloracetic acid precipitation of the supernatant by the addition ofcold trichloracetic acid, for a final concentration of 10%. Samples wereincubated at 4° C. overnight prior to centrifugation at 10,000×g for 20minutes at 4° C. Supernatants were discarded, and pellets were washedtwice with 1 ml of PBS, suspended in 200 ul of SP buffer, andneutralized with 1 ul of 5N NAOH. Samples were boiled for 5 minutes andsolubilized proteins were electrophoresed on a 5.0% polyacrylamide gelprior to electrophoretic transfer to nitrocellulose paper (Towbin, etal., Proc. Natl. Acad. Sci., U.S.A., 76:4350-4354 (1979)).

After protein transfer, the nitrocellulose was cut into strips andreacted with a pool of the two MAbs (monoclonal antibodies) designated5B8 and 6E7. For this procedure, nitrocellulose blots were blocked in1.5% bovine serum albumin (BSA) -1.5% gelatin in TBS for 3-4 hours priorto incubation with the pooled monoclonal antibodies. The finalconcentration of the antibodies in the reaction mixture was 2 ug/ml 5B8and 1 ug/ml 6E7 in a buffer comprising TBS plus 20% FCS. Blots wereincubated with the diluted antibody preparation overnight at roomtemperature with shaking, following by three ten minute washes with TBS.Horseradish peroxidase-conjugated goat anti-mouse IgG diluted 1:2000 inTBS containing 0.75% BSA 0.75% gelatin was added to the blots andincubated with shaking for 3-4 hours at room temperature. Blots werewashed three times for ten minute periods with TBS prior to substratedevelopment.

The results of this procedure, shown in FIG. 11, show the representativeclones produced fusion proteins larger than the control lambda gt11beta-galactosidase protein. However, except for clone P1-7, the size ofeach fusion protein was much smaller than that predicted from the sizeof the corresponding recombinant DNA insert. This finding may beexplained as resulting from early termination of the cytadhesin peptidedue to the presence of the TGA codon at position 4556. The presentinventors have discovered that M. pneumonias utilizes this codon fortryptophan, while E. coli reads UGA as stop signal. Therefore, when E.coli is used as a host for a vector containing the recombinantPneumonias insert, a prematurely truncated polypeptide may be produced.

E. Cytadhesin Peptides Can Be Used for Serodiagnosis of M. pneumoniasInfection

Studies have shown that adhesin P1 is highly immunogenic (Hu, et al.,Science, 216:313-315 (1982)) and patients infected with M. pneumoniasexhibit neutralizing antibodies to the P1 adhesin (Leith, et al., J.Exp. Med., 157:502-516 (1983)). Since the isolated clones express P1cytadhesin peptides, these clones were analyzed for reactivity with seraof patients with early and late stages of M. pneumonias infection.Normal human sera was used as a control. These experiments wereperformed by the immunophage blot method. Briefly, this procedure wasperformed as follows. Individual recombinant phages were dotted on alawn of E. coli Y1090. The plates were incubated at 42° C. for 3-5hours. Then a nitrocellulose filter (HAHY, M) previously saturated with10 mM IPTG was overlayed on individual plates and incubation continuedat 37° C. overnight. Filters were removed and reacted with sera from M.pneumoniae infected patients or normal human controls essentially asdescribed in FIGS. 12-I, 12-II, 12-III using horseradishperoxidase-conjugated goat anti-human immunoglobulin, and4-chloro-1-naphthol to develop the immunoblots.

The results of this procedure, shown in FIG. 12, indicated that fusionproteins produced by all ten anti-P1 MAb reactive clones also reactedwith acute and convalescent sera of M. pneumonias infected patients butdid not react with normal human serum. Therefore, the cytadherencerelated P1 peptides or fusion proteins described herein may be used forserodiagnosis of patients infected with M. pneumonias.

F. Preparation of Recombinant Antigens from the Lambda gt11 RecombinantClones

It is often useful to have preparative amounts of polypeptides specifiedby a cloned piece of DNA. For some purposes, for instance,radioimmunoassays, it is sufficient to have a crude E. coli lysatecontaining an antigen specified by the cloned DNA of interest. Thisprophetic example illustrates how a crude lysate containing a cytadhesinpeptide fusion protein can be prepared by expressing a lambda gt11recombinant as a lysogen in E. coli 1089 (E. coli Delta lac U169proA+Delta lon ara D139 strA hsl A150 (chr::Tn10] (p MC9)). Therecombinant fusion protein would be produced by lysogenizing Y1089 withthe lambda gt11 clone of interest. The lysogen would be grown to highcell density, lacZ-directed fusion protein production induced by theaddition of IPTG to the medium, and the cells harvested and lysed.

More specifically, the Y1089 cells would be grown to saturation in LBmedium (pH 7.5/0.2% maltose) at 37° C. and then infected with theselected lambda gt11 recombinant phage (preferably P1-7) at amultiplicity of approximately 5 for 20 minutes at 32° C. in LB medium(pH 7.5) supplemented with 10 mm MgCl₂. The cells would then be platedon LB plate at a density of approximately 200/plate and incubated at 32°C. At this temperature, the temperature sensitive phage repressor isfunctional. Single colonies would be tested for temperature sensitivityat 42° C. by spotting cells from single colonies using steriletoothpicks onto two LB plates. The first plate would be incubated at 42°C. and the second at 32° C. Clones growing at 32° C. but not at 42° C.are assumed to be lysogens. Lysogens should arise at a frequency between10% and 70%.

The crude lysate would then be prepared from the lambda gill recombinantlysogen by incubating 100 ml of LB medium with a single colony of theY1089 recombinant lysogen at 32° C. with aeration. When the culture hasgrown to an optical density of 0.5 measured at 600 mm, the temperatureof the culture would be increased to 42°-54° C. as rapidly as possibleand the culture incubated at the elevated temperature for 20 minuteswith good aeration. IPTG would be added to 10 mM and the culture isincubated at 37°-38° C. for approximately one hour. At this stage, theY1089 lysogen will sometimes lyse, even though the Y1089 does notsuppress the mutation, causing defective lyses (S100) in lambda gt11.The reason for this is that the S100 amber mutation is leaky and foreignproteins accumulating in E. coli often render it susceptible to lysis.Therefore, the longest incubation time achievable at 37°-38° C. withoutlysis occurring should be determined for each individual recombinantlysogen. After incubation, the cells would be harvested in a Beckman J.A.-ten rotor at 5,000 r.p.m. for 5 minutes 27°-37° C. The cells wouldthen be rapidly resuspended in 1/20 to 1/50 of the original culturevolume in a buffer suitable for protein and the resuspended cells arerapidly frozen in liquid nitrogen. When the frozen cells are thawed,essentially complete lysis of the induced lysogen results.

If crude antigen is required, the crude lysate described above could beused. However, if pure antigen is needed, the beta-galactosidase fusionprotein would be purified by any of a number of methods known to thoseof skill in the art. The most rapid method of purification takesadvantage of the size of the beta-galactosidase fusion protein(approximately 114 kDa). Since only a few proteins in E. coli are largerthan beta-galactosidase, the fusion protein is often resolved from otherproteins on SDS-polyacrylamide gels. Preparative gels could be used toisolate large quantities of denatured protein. If pure antigen in nativeform is required, then the fusion protein could be prepared by classicalcolumn chromatography.

G. Synthesis of a Synthetic Peptide Containing the Amino Acid CytadhesinEpitopes

The following prophetic example describes methods for preparingsynthetic polypeptides containing cytadhesin epitopes. M. pneumonias P1polypeptides could be prepared by any of a number of methods known tothose of skill in the art. These methods include but are not limited tosolid and liquid phase chemical synthesis and biological in vitrosynthesis. For example, see Marglin and Merrifield, Annu. Red. Biochem.,39:841-866 (1970); Merrifield, et al., Biochemistry, 21:5020-5031(1982); Pelham and Jackson, Eru. J. Biochem., 67:247-256 (1976); andShinnick, et al., Ann. Rev. Microbiol., 37:425-446 (1983), allincorporated herein by reference. Of course, where an MRNA translationsystem is used, e.g., reticulocyte lysate system, it is important toprepare mRNA from the DNA clones of the present invention. Techniquesfor preparing the MRNA from DNA clones are known in the art. Forexample, see those described in Chapter 2, 1987 Promega BiologicalResearch Products Catalogue, obtainable from Promega Biolabs, 2800 SouthFish Hatchery Road, Madison, Wis. 53711-5305 and incorporated herein byreference. A preferred method for preparing a synthetic peptide may befound in U.S. Pat. No. 4,493,795 issued to Nestor, Jr., et al., andincorporated herein by reference. A second method is found in U.S. Pat.No. 4,474,757, issued to Arman, et al., and also incorporated herein byreference.

H. Preparation of M. pneumonias Compositions For Use As M. pneumoniasVaccines

Of course, it is also likely that the cytadhesin peptides may beeffectively used as vaccines to prevent atypical pneumonias caused by M.pneumonias. The preparation of vaccines which contain peptide sequencesas active ingredients is generally well understood in the art, asexemplified by U.S. Pat. Nos. 4,474,757; 4,493,795; 4,608,251;4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, allincorporated herein by reference.

This prophetic example describes preparation and administration of suchvaccines. In general, immunogenic compositions suitable foradministration as vaccines could be formulated to include one or more ofthe antigenic epitopes produced by the recombinant cells of the presentinvention or synthetically prepared. The antigens could be included inoptimal amounts, for example, approximately equimolar or equi-antigenicamounts. Typically, such vaccines are prepared as injectables: either asliquid solutions or suspensions, solid forms suitable for solution in,or suspension in, liquid prior to injection may also be prepared. Thepreparation could also be emulsified. The reactive immunogenicingredient is often mixed with excipients which are pharmaceuticallyacceptable and compatible with the active ingredient. Suitableexcipients are, for example, water, saline, dextrose, glycerol, ethanol,or the like and combinations thereof. In addition, if desired, thevaccine could contain minor amounts of auxiliary substances such aswetting or emulsifying agents, pH buffering agents, or adjuvants whichenhance the effectiveness of the vaccine.

In addition, immunogenicity of cytadhesin peptides could be increased byconjugation of a carrier molecule, for example, dipalmityl lysine. (SeeHopp, Mol. Immunol., 21:13-16 (1984) incorporated herein by reference.)

The proteins or polypeptides could be formulated into the vaccine asneutral or salt forms and administered in a manner compatible with thedosage formulation, and in such amount as will be therapeuticallyeffective and immunogenic. The vaccines could be conventionallyadministered parenterally, by injection, for example, eithersubcutaneously or intramuscularly. Additional formulations which aresuitable for other modes of administration might include oral orintranasal formulations. The quantity to be administered will depend onthe subject to be treated, capacity of the immune system to synthesizeantibodies, and the degree of protection desired. Precise amounts ofactive ingredient required to be administered will depend on thejudgment of the practitioner and may be peculiar to each individual.However, suitable dosage ranges will be on the order of 1 to 100 ugactive ingredient per individual. Suitable regimes for initialadministration and booster shots will also be variable, but may betypified by an initial administration followed by subsequentinoculations or other administrations.

In many instances, it may be desirable to have multiple administrationsof the vaccine, usually not exceeding six vaccinations, more usually notexceeding four vaccinations and preferably one or more, usually at leastabout three vaccinations. The vaccinations will normally be at from twoto twelve week intervals, more usually from three to five weekintervals. Periodic boosters at intervals of 1-5 years, usually threeyears, will be desirable to maintain protective levels of theantibodies. The course of the immunization may be followed by assays forantibodies for the antigens as described below.

I. Immunoassay For M. pneumonias Antibodies

As demonstrated by Example IV E., certain of the P1 polypeptides areknown to react with antisera from patients infected with M. pneumonias.Accordingly, these polypeptides may be used as antigens in immunoassayprocedures. These assays are well known to those of skill in the art.For examples of such assays, see Nisonoff, Introduction to MolecularImmunology, 2nd Ed., Sinaues Associates, Inc., Sunderland, Mass. (1984)and U.S. Pat. No. 4,376,110, both incorporated herein by reference.

The following prophetic example is designed to illustrate suchprocedures. Generally, for detection of antibody in aqueous samples, theantigen, or antigen composition, is preferably adsorbed, or otherwiseattached, to an appropriate adsorption matrix, for example, the insidesurface of a microtiter dish well, and an aqueous suspectedantibody-containing composition contacted therewith to causeimmunocomplex formation. The matrix is then washed to removenon-specifically bound material and the amount of material which isspecifically immunocomplexed thereto determined, typically through theuse of an appropriate labeled ligand.

The cytadhesin polypeptides provided by the present invention may alsobe incorporated into a diagnostic kit. Such kits are widely used inclinical settings because they often offer greater convenience andsimplicity than other assays. A number of kits might be utilized in thepractice of the present invention, for example, a kit comprising acarrier compartmentalized to receive at least one, at least two, or atleast three or more containers and to maintain said containers enclosedconfinement.

A first container might include one or more of the M. pneumoniasantigens, or antigen-containing compositions. Alternatively, or inaddition, the kits will include antibody compositions having specificityfor one or more of the antigens. Both antibody and antigen preparationsshould preferably be provided in a suitable titrated form, with antigenconcentrations and/or antibody titers given for easy reference inquantitative applications.

The kits will also typically include an immunodetection reagent or labelfor the detection of specific immunoreaction between the providedantigen and/or antibody, as the case may be, and the diagnostic sample.Suitable detection reagents are well known in the art as exemplified byradioactive, enzymatic or otherwise chromogenic ligands, which aretypically employed in association with the antigen and/or antibody, orin association with a second antibody having specificity for the antigenor first antibody. Thus, the reaction is detected or quantified by meansof detecting or quantifying the label. Immunodetection reagents andprocesses suitable for application in connection with the novelcompositions of the present invention are generally well known in theart.

The foregoing description has been directed to particular embodiments ofthe invention in accordance with the requirements of the Patent Statutesfor the purposes of illustration and explanation. It will be apparent,however, to those skilled in this art, that many modifications andchanges in the apparatus and procedure set forth will be possiblewithout departing from the scope and spirit of the invention. It isintended that the following claims be interpreted to embrace all suchmodifications and changes.

    __________________________________________________________________________    SEQUENCE LISTING                                                              (1) GENERAL INFORMATION:                                                      (iii) NUMBER OF SEQUENCES: 10                                                 (2) INFORMATION FOR SEQ ID NO:1:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 13 Amino Acids                                                    (B) TYPE: Amino Acid                                                          (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Deduced amino acid sequence of                                polypeptide fragment no. 1                                                   (iii) HYPOTHETICAL: No                                                        (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E.                             coli Y1090                                                                    (ix) FEATURE:                                                                 (A) NAME/KEY: Deduced amino acid sequence of                                  polypeptide fragment no. 1                                                    (B) LOCATION: Amino Acids: 1383 to 1395                                       (D) OTHER INFORMATION: Phenotype Conferred:                                   cytadhering and virulent; Biological Activity:                                cytadherence; Functional Class: cytadhesin;                                   Binding Macromolecules: receptors unknown;                                    Subcellular Location: membrane                                                (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                       GlyIleValArgThrProLeuAlaGluLeuLeuAspGly                                       1 510                                                                         (2) INFORMATION FOR SEQ ID NO:2:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 39 Nucleotides                                                    (B) TYPE: Nucleic Acid                                                        (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Nucleic acid sequence of polypeptide                         fragment no. 1                                                                (iii) HYPOTHETICAL: No                                                        (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Nucleic acid sequence of polypeptide fragemnt                   no. 1                                                                         (B) LOCATION: Nucleotide Numbers: 4147 to 4185                                 (D) OTHER INFORMATION: Phenotype Conferred: cytadhering and                  virulent; Biological Activity: cytadherence; Functional                       Class: cytadhesin; Binding Macromolecules: receptors                          unknown; Subcellular Location: membrane                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                       GGTATTGTACGCACCCCACTCGCTGAACTGTTAGATGGG4185                                   GlyIleValArgThrProLeuAlaGluLeuLeu AspGly                                      1510                                                                          (2) INFORMATION FOR SEQ ID NO:3:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 40 Amino Acids                                                    (B) TYPE: Amino Acid                                                          (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Deduced amino acid sequence of                               polypeptide fragment no. 2                                                    (iii) HYPOTHETICAL: No                                                         (v) FRAGMENT TYPE: internal fragment                                         (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Deduced amino acid sequence of polypeptide                      fragment no. 2                                                                (B) LOCATION: Amino Acid Numbers: 1356 to 1395                                ( xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                      AsnThrAsnThrGlyAsnAspValValGlyValGlyArgLeuSerGlu                              151015                                                                        SerAsnAlaAlaLysMetAsnAspAspValAspGlyIleValArgTh r                             202530                                                                        ProLeuAlaGluLeuLeuAspGly                                                      3540                                                                          (2) INFORMATION FOR SEQ ID NO:4:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 120 Nucleotides                                                   (B) TYPE: Nucleic Acid                                                        (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                           (ii) MOLECULE TYPE: Genomic DNA                                              (A) DESCRIPTION: Nucleic acid sequence for polypeptide                        fragment no. 2                                                                (iii) HYPOTHETICAL: No                                                        (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                  (A) NAME/KEY: Nucleic acid sequence of polypeptide fragment                  no. 2                                                                         (B) LOCATION: Nucleotide Numbers: 4066 to 4185                                (ix) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                       AATACTAATACGGGGAATGATGTGGTGGGGGTTGGTCGACTTTCTGAA4113                          AsnThrAsnThrGlyAsnAspValValGlyValGlyAr gLeuSerGlu                             151015                                                                        AGCAACGCCGCAAAGATGAATGACGATGTTGATGGTATTGTACGCACC4161                          SerAsnAlaAlaLysMetAsnAspAspValAspGlyIleV alArgThr                             202530                                                                        CCACTCGCTGAACTGTTAGATGGG4185                                                  ProLeuAlaGluLeuLeuAspGly                                                      3540                                                                          (2) INFORMATION FOR SEQ ID NO:5:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 136 Amino Acids                                                    (B) TYPE: Amino Acid                                                         (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Deduced amino acid sequence of                               polypeptide fragment no. 3                                                    (iii) HYPOTHETICAL: No                                                        (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Deduced amino acid sequence of polypeptide                      fragment no. 3                                                                (B) LOCATION: Amino Acid Numbers: 1383 to 1518                                (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                       GlyIleValArgThrProLeuAlaGluLeuLeuAspGlyGluGlyGln                              15 1015                                                                       ThrAlaAspThrGlyProGlnSerValLysPheLysSerProAspGln                              202530                                                                        IleAspPheAsnArgLeuPheThrHisProValThrAs pLeuPheAsp                             354045                                                                        ProValThrMetLeuValTyrAspGlnTyrIleProLeuPheIleAsp                              505560                                                                        IleProAlaSerValAsnProLy sMetValArgLeuLysValLeuSer                             65707580                                                                      PheAspThrAsnGluGlnSerLeuGlyLeuArgLeuGluPhePheLys                              8590 95                                                                       ProAspGlnAspThrGlnProAsnAsnAsnValGlnValAsnProAsn                              100105110                                                                     AsnGlyAspPheLeuProLeuLeuThrAlaSerSerGlnGlyProGln                              115 120125                                                                    ThrLeuPheSerProPheAsnGln                                                      130135                                                                        (2) INFORMATION FOR SEQ ID NO:6:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 408 Nucleotides                                                   (B) TYPE: Nucleic Acid                                                        (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                                (A) DESCRIPTION: Nucleic acid sequence of polypeptide                        fragment no. 3                                                                (iii) HYPOTHETICAL: No                                                        (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Nucleic acid sequence of polypeptide fragment                   no. 3                                                                         (B) LOCATION: Nucleotide Numbers: 4147 to 4554                                (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                       GGTATTGTACGCACCCCACTCGCTGAACTGTTAGATGGGGAAGGACAA4194                          GlyIleValArgThrProLeuAlaGluLeuLeuAspGlyGluGlyGln                              1 51015                                                                       ACAGCTGACACTGGTCCACAAAGCGTGAAGTTCAAGTCTCCTGACCAA4242                          ThrAlaAspThrGlyProGlnSerValLysPheLysSerProAspGln                              20 2530                                                                       ATTGACTTCAACCGCTTGTTTACCCACCCAGTCACCGATCTGTTTGAT4290                          IleAspPheAsnArgLeuPheThrHisProValThrAspLeuPheAsp                              3540 45                                                                       CCGGTAACTATGTTGGTGTATGACCAGTACATACCGCTGTTTATTGAT4338                          ProValThrMetLeuValTyrAspGlnTyrIleProLeuPheIleAsp                              505560                                                                        ATCCCAGCAAGTGTGAACCCTAAAATGGTTCGTTTAAAGGTCTTGAGC4386                          IleProAlaSerValAsnProLysMetValArgLeuLysValLeuSer                              6570758 0                                                                     TTTGACACCAACGAACAGAGCTTAGGTCTCCGCTTAGAGTTCTTTAAA4434                          PheAspThrAsnGluGlnSerLeuGlyLeuArgLeuGluPhePheLys                              859095                                                                        CCTGATCAAGAT ACCCAACCAAACAACAACGTTCAGGTCAATCCGAAT4482                         ProAspGlnAspThrGlnProAsnAsnAsnValGlnValAsnProAsn                              100105110                                                                     AACGGTGACTTCTTACCACTGTTAA CGGCCTCCAGTCAAGGTCCCCAA4530                         AsnGlyAspPheLeuProLeuLeuThrAlaSerSerGlnGlyProGln                              115120125                                                                     ACCTTGTTTAGTCCGTTTAACCAG4554                                                  ThrLeuPhe SerProPheAsnGln                                                     130135                                                                        (2) INFORMATION FOR SEQ ID NO:7:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 118 Amino Acids                                                   (B) TYPE: Amino Acid                                                          (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Deduced amino acid sequence of                               polypeptide fragment no. 4                                                    (iii) HYPOTHETICAL: No                                                         (v) FRAGMENT TYPE: internal fragment                                         (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Deduced amino acid sequence of polypeptide                      fragment no. 4                                                                (B) LOCATION: Amino Acid Numbers: 1401 to 1518                                (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                       AspThrGlyProGlnSerValLysPheLysSerProAspGlnIleAsp                              151015                                                                        PheAsnArgLeuPheThrHisProValThrAspLeuPheAspProVal                              20 2530                                                                       ThrMetLeuValTyrAspGlnTyrIleProLeuPheIleAspIlePro                              354045                                                                        AlaSerValAsnProLysMetValArgLeuLysVal LeuSerPheAsp                             505560                                                                        ThrAsnGluGlnSerLeuGlyLeuArgLeuGluPhePheLysProAsp                              65707580                                                                      GlnAsp ThrGlnProAsnAsnAsnValGlnValAsnProAsnAsnGly                             859095                                                                        AspPheLeuProLeuLeuThrAlaSerSerGlnGlyProGlnThrLeu                              100105 110                                                                    PheSerProPheAsnGln                                                            115                                                                           (2) INFORMATION FOR SEQ ID NO:8:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 354 Nucleotides                                                   (B) TYPE: Nucleic Acid                                                        (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Nucleic acid sequence of polypeptide                         fragment no. 4                                                                ( iii) HYPOTHETICAL: No                                                       (v) FRAGMENT TYPE: internal fragment                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in lambda gt11 phage in E. coli                        Y1090                                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Nucleic acid sequence of polypeptide fragment                   no. 4                                                                         (B) LOCATION: Nucleotide Numbers: 4201 to 4554                                (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                       GACACTGGTCCACAAAGCGTGAAGTTCAAGTCTCCTGACCAAATTGAC4248                          AspThrGlyProGlnSerValLysPheLysSerProAspGlnIleAsp                              1510 15                                                                       TTCAACCGCTTGTTTACCCACCCAGTCACCGATCTGTTTGATCCGGTA4296                          PheAsnArgLeuPheThrHisProValThrAspLeuPheAspProVal                              202530                                                                        ACT ATGTTGGTGTATGACCAGTACATACCGCTGTTTATTGATATCCCA4334                         ThrMetLeuValTyrAspGlnTyrIleProLeuPheIleAspIlePro                              354045                                                                        GCAAGTGTGAACCCTAAA ATGGTTCGTTTAAAGGTCTTGAGCTTTGAC4392                         AlaSerValAsnProLysMetValArgLeuLysValLeuSerPheAsp                              505560                                                                        ACCAACGAACAGAGCTTAGGTCTCCGCTTAGA GTTCTTTAAACCTGAT4440                         ThrAsnGluGlnSerLeuGlyLeuArgLeuGluPhePheLysProAsp                              65707580                                                                      CAAGATACCCAACCAAACAACAACGTTCAG GTCAATCCGAATAACGGT4488                         GlnAspThrGlnProAsnAsnAsnValGlnValAsnProAsnAsnGly                              859095                                                                        GACTTCTTACCACTGTTAACGGCCTCCAGTCAAGGTCCCCAAACC TTG4536                         AspPheLeuProLeuLeuThrAlaSerSerGlnGlyProGlnThrLeu                              100105110                                                                     TTTACTCCGTTTAACCAG4554                                                        PheSerProPheAsnGln                                                            115                                                                           (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                (A) LENGTH: 1627 Amino Acids                                                  (B) TYPE: Amino Acid                                                          (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Deduced amino acid sequence of P1                            protein                                                                       (iii) HYPOTHETICAL: No                                                        (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Mycoplasma pneumoniae                                           (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                        (B) CLONE: Gene cloned in pUC19 in E. coli HB101, ATCC                       Accession Number 67560                                                        (ix) FEATURE:                                                                 (A) NAME/KEY: Amino acid sequence of P1 protein                               (B) LOCATION: Amino Acid Numbers: 1 to 1627                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                       MetHisGlnThrLysLysThrAlaLeuSerLysSerThrTrpIleLeu                              15 1015                                                                       IleLeuThrAlaThrAlaSerLeuAlaThrGlyLeuThrValValGly                              202530                                                                        HisPheThrSerThrThrThrThrLeuLysArgGlnGlnPhe SerTyr                             354045                                                                        ThrArgProAspGluValAlaLeuArgHisThrAsnAlaIleAsnPro                              505560                                                                        ArgLeuThrProTrpThrTyrArgAsn ThrSerPheSerSerLeuPro                             65707580                                                                      LeuThrGlyGluAsnProGlyAlaTrpAlaLeuValArgAspAsnSer                              8590 95                                                                       AlaLysGlyIleThrAlaGlySerGlySerGlnGlnThrThrTyrAsp                              100105110                                                                     ProThrArgThrGluAlaAlaLeuThrAlaSerThrThrPheAlaLeu                              115 120125                                                                    ArgArgTyrAspLeuAlaGlyArgAlaLeuTyrAspLeuAspPheSer                              130135140                                                                     LysLeuAsnProGlnThrProThrArgAspGlnThrGlyGlnIleTh r                             145150155160                                                                  PheAsnProPheGlyGlyPheGlyLeuSerGlyAlaAlaProGlnGln                              165170175                                                                     TrpAsnGluValLys AsnLysValProValGluValAlaGlnAspPro                             180185190                                                                     SerAsnProTyrArgPheAlaValLeuLeuValProArgSerValVal                              195200205                                                                     TyrTyrGluGlnLeuGlnArgGlyLeuGlyLeuProGlnGlnArgThr                              210215220                                                                     GluSerGlyGlnAsnThrSerThrThrGlyAlaMetPheGlyLeuLys                              22523 0235240                                                                 ValLysAsnAlaGluAlaAspThrAlaLysSerAsnGluLysLeuGln                              245250255                                                                     GlyAlaGluAlaThrGlySerSerThrThrSerG lySerGlyGlnSer                             260265270                                                                     ThrGlnArgGlyGlySerSerGlyAspThrLysValLysAlaLeuLys                              275280285                                                                     IleGluValLysLysLys SerAspSerGluAspAsnGlyGlnLeuGln                             290295300                                                                     LeuGluLysAsnAspLeuAlaAsnAlaProIleLysArgSerGluGlu                              305310315 320                                                                 SerGlyGlnSerValGlnLeuLysAlaAspAspPheGlyThrAlaLeu                              325330335                                                                     SerSerSerGlySerGlyGlyAsnSerAsnProGlySerProThrPro                              340 345350                                                                    TrpArgProTrpLeuAlaThrGluGlnIleHisLysAspLeuProLys                              355360365                                                                     TrpSerAlaSerIleLeuIleLeuTyrAspAlaPro TyrAlaArgAsn                             370375380                                                                     ArgThrAlaIleAspArgValAspHisLeuAspProLysAlaMetThr                              385390395400                                                                  AlaAs nTyrProProSerTrpArgThrProLysTrpAsnHisHisGly                             405410415                                                                     LeuTrpAspTrpLysAlaArgAspValLeuLeuGlnThrThrGlyPhe                              420425 430                                                                    PheAsnProArgArgHisProGluTrpPheAspGlyGlyGlnThrVal                              435440445                                                                     AlaAspAsnGlyLysThrGlyPheAspValAspAsnSerGluAsnThr                              450 455460                                                                    LysGlnGlyPheGlnLysGluAlaAspSerAspLysSerAlaProIle                              465470475480                                                                  AlaLeuProPheGluAlaTyrPhe AlaAsnIleGlyAsnLeuThrTrp                             485490495                                                                     PheGlyGlnAlaLeuLeuValPheGlyGlyAsnGlyHisValThrLys                              500505510                                                                     SerAlaH isThrAlaProLeuSerIleGlyValPheArgValArgTyr                             515520525                                                                     AsnAlaThrGlyThrSerAlaThrValThrGlyTrpProTyrAlaLeu                              530535 540                                                                    LeuPheSerGlyMetValAsnLysGlnThrAspGlyLeuLysAspLeu                              545550555560                                                                  ProPheAsnAsnAsnArgTrpPheGluTyrValProArgMet AlaVal                             565570575                                                                     AlaGlyAlaLysPheValGlyArgGluLeuValLeuAlaGlyThrIle                              580585590                                                                     ThrMetGlyAspThrAlaThrValPr oArgLeuLeuTyrAspGluLeu                             595600605                                                                     GluSerAsnLeuAsnLeuValAlaGlnGlyGlnGlyLeuLeuArgGlu                              610615620                                                                     AspLeuGln LeuPheThrProTyrGlyTrpAlaAsnArgProAspLeu                             625630635640                                                                  ProIleGlyAlaTrpSerSerSerSerSerSerSerHisAsnAlaPro                              645 650655                                                                    TyrTyrPheHisAsnAsnProAspTrpGlnAspArgProIleGlnAsn                              660665670                                                                     ValValAspAlaPheIleLysProTrpGluAspLysAsnGlyLys Asp                             675680685                                                                     AspAlaLysTyrIleTyrProTyrArgTyrSerGlyMetTrpAlaTrp                              690695700                                                                     GlnValTyrAsnTrpSerAsnLysLeuT hrAspGlnProLeuSerAla                             705710715720                                                                  AspPheValAsnGluAsnAlaTyrGlnProAsnSerLeuPheAlaAla                              7257307 35                                                                    IleLeuAsnProGluLeuLeuAlaAlaLeuProAspLysValLysTyr                              740745750                                                                     GlyLysGluAsnGluPheAlaAlaAsnGluTyrGluArgPheAsnGln                              755 760765                                                                    LysLeuThrValAlaProThrGlnGlyThrAsnTrpSerHisPheSer                              770775780                                                                     ProThrLeuSerArgPheSerThrGlyPheAsnLeuValGlySerVa l                             785790795800                                                                  LeuAspGlnValLeuAspTyrValProTrpIleGlyAsnGlyTyrArg                              805810815                                                                     TyrGlyAsnAsnHis ArgGlyValAspAspIleThrAlaProGlnThr                             820825830                                                                     SerAlaGlySerSerSerGlyIleSerThrAsnThrSerGlySerArg                              835840845                                                                     SerPheLeuProThrPheSerAsnIleGlyValGlyLeuLysAlaAsn                              850855860                                                                     ValGlnAlaThrLeuGlyGlySerGlnThrMetIleThrGlyGlySer                              86587 0875880                                                                 ProArgArgThrLeuAspGlnAlaAsnLeuGlnLeuTrpThrGlyAla                              885890895                                                                     GlyTrpArgAsnAspLysAlaSerSerGlyGlnS erAspGluAsnHis                             900905910                                                                     ThrLysPheThrSerAlaThrGlyMetAspGlnGlnGlyGlnSerGly                              915920925                                                                     ThrSerAlaGlyAsnPro AspSerLeuLysGlnAspAsnIleSerLys                             930935940                                                                     SerGlyAspSerLeuThrThrGlnAspGlyAsnAlaIleAspGlnGln                              945950955 960                                                                 GluAlaThrAsnTyrThrAsnLeuProProAsnLeuThrProThrAla                              965970975                                                                     AspTrpProAsnAlaLeuSerPheThrAsnLysAsnAsnAlaGlnArg                              980 985990                                                                    AlaGlnLeuPheLeuArgGlyLeuLeuGlySerIleProValLeuVal                              99510001005                                                                   AsnArgSerGlySerAspSerAsnLysPheGlnAla ThrAspGlnLys                             101010151020                                                                  TrpSerTyrThrAspLeuHisSerAspGlnThrLysLeuAsnLeuPro                              1025103010351040                                                              Ala TyrGlyGluValAsnGlyLeuLeuAsnProAlaLeuValGluThr                             104510501055                                                                  TyrPheGlyAsnThrArgAlaGlyGlySerGlySerAsnThrThrSer                              10601065 1070                                                                 SerProGlyIleGlyPheLysIleProGluGlnAsnAsnAspSerLys                              107510801085                                                                  AlaThrLeuIleThrProGlyLeuAlaTrpThrProGlnAspValGly                              109 010951100                                                                 AsnLeuValValSerGlyThrThrValSerPheGlnLeuGlyGlyTrp                              1105111011151120                                                              LeuValThrPheThrAsp PheValLysProArgAlaGlyTyrLeuGly                             112511301135                                                                  LeuGlnLeuThrGlyLeuAspAlaSerAspAlaThrGlnArgAlaLeu                              114011451150                                                                   IleTrpAlaProArgProTrpAlaAlaPheArgGlySerTrpValAsn                             115511601165                                                                  ArgLeuGlyArgValGluSerValTrpAspLeuLysGlyValTrpAla                              1170117 51180                                                                 AspGlnAlaGlnSerAspSerGlnGlySerThrThrThrAlaThrArg                              1185119011951200                                                              AsnAlaLeuProGluHisProAsnAlaLeuAla PheGlnValSerVal                             120512101215                                                                  ValGluAlaSerAlaTyrLysProAsnThrSerSerGlyGlnThrGln                              122012251230                                                                  SerThrAsnSerSer ProTyrLeuHisLeuValLysProLysLysVal                             123512401245                                                                  ThrGlnSerAspLysLeuAspAspAspLeuLysAsnLeuLeuAspPro                              12501255126 0                                                                 AsnGlnValArgThrLysLeuArgGlnSerPheGlyThrAspHisSer                              1265127012751280                                                              ThrGlnProGlnProGlnSerLeuLysThrThrThrProValPheGly                               128512901295                                                                 ThrSerSerGlyAsnLeuSerSerValLeuSerGlyGlyGlyAlaGly                              130013051310                                                                  GlyGlySerSerGlySerGlyGlnSerGly ValAspLeuSerProVal                             131513201325                                                                  GluLysValSerGlyTrpLeuValGlyGlnLeuProSerThrSerAsp                              133013351340                                                                  GlyAsnThrSer SerThrAsnAsnLeuAlaProAsnThrAsnThrGly                             1345135013551360                                                              AsnAspValValGlyValGlyArgLeuSerGluSerAsnAlaAlaLys                              1365 13701375                                                                 MetAsnAspAspValAspGlyIleValArgThrProLeuAlaGluLeu                              138013851390                                                                  LeuAspGlyGluGlyGlnThrAlaAspThrGlyProGlnSerVal Lys                             139514001405                                                                  PheLysSerProAspGlnIleAspPheAsnArgLeuPheThrHisPro                              141014151420                                                                  ValThrAspLeuPheAspProValThr MetLeuValTyrAspGlnTyr                             1425143014351440                                                              IleProLeuPheIleAspIleProAlaSerValAsnProLysMetVal                              14451450 1455                                                                 ArgLeuLysValLeuSerPheAspThrAsnGluGlnSerLeuGlyLeu                              146014651470                                                                  ArgLeuGluPhePheLysProAspGlnAspThrGlnProAsnAsnAsn                              1475 14801485                                                                 ValGlnValAsnProAsnAsnGlyAspPheLeuProLeuLeuThrAla                              149014951500                                                                  SerSerGlnGlyProGlnThrLeuPheSerProPheAsnGln TrpPro                             1505151015151520                                                              AspTyrValLeuProLeuAlaIleThrValProIleValValIleVal                              152515301535                                                                  LeuSerVal ThrLeuGlyLeuAlaIleGlyIleProMetHisLysAsn                             154015451550                                                                  LysGlnAlaLeuLysAlaGlyPheAlaLeuSerAsnGlnLysValAsp                              15551560 1565                                                                 ValLeuThrLysAlaValGlySerValPheLysGluIleIleAsnArg                              157015751580                                                                  ThrGlyIleSerGlnAlaProLysArgLeuLysGlnThrSerAlaAla                              1585 159015951600                                                             LysProGlyAlaProArgProProValProProLysProGlyAlaPro                              160516101615                                                                  LysProProValGlnProProLys LysProAla                                            16201625                                                                      (2) INFORMATION FOR SEQ ID NO:10:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 4884 Nucleotides                                                  (B) TYPE: Nucleic Acid                                                        (C) STRANDEDNESS: Single                                                      (D) TOPOLOGY: Linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (A) DESCRIPTION: Nucleic Acid Sequence of P1 Protein                          (iii) HYPOTHETICAL: No                                                        (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Mycoplasma pneumoniae                                          (B) STRAIN: M129-B16                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Gene cloned in pUC19 in E. coli HB101, ATCC                        Accession Number 67560                                                        (ix) FEATURE:                                                                 (A) NAME/KEY: Nucleic Acid Sequence of P1 Protein                             (B) LOCATION: Nucleotide Numbers: 1 to 4884                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                      ATGCACCAAACCAAAAAAACTGCCTTGTC CAAGTCCACTTGGATTCTC48                           MetHisGlnThrLysLysThrAlaLeuSerLysSerThrTrpIleLeu                              151015                                                                        ATCCTCACCGCCACCGCCTCCCTCGCGACG GGACTCACCGTAGTGGGA96                           IleLeuThrAlaThrAlaSerLeuAlaThrGlyLeuThrValValGly                              202530                                                                        CACTTCACAAGTACCACCACGACGCTCAAGCGCCAGCAATTTAG CTAC144                          HisPheThrSerThrThrThrThrLeuLysArgGlnGlnPheSerTyr                              354045                                                                        ACCCGCCCTGACGAGGTCGCGCTGCGCCACACCAATGCCATCAACCCG192                           Thr ArgProAspGluValAlaLeuArgHisThrAsnAlaIleAsnPro                             505560                                                                        CGCTTAACCCCGTGAACGTATCGTAACACGAGCTTTTCCTCCCTCCCC240                           ArgLeuThrProTrpTh rTyrArgAsnThrSerPheSerSerLeuPro                             65707580                                                                      CTCACGGGTGAAAATCCCGGGGCGTGGGCCTTAGTGCGCGACAACAGC288                           LeuThrGlyGluAsn ProGlyAlaTrpAlaLeuValArgAspAsnSer                             859095                                                                        GCTAAGGGCATCACTGCCGGCAGTGGCAGTCAACAAACCACGTATGAT336                           AlaLysGlyIleThrAlaGlySerGlyS erGlnGlnThrThrTyrAsp                             100105110                                                                     CCCACCCGAACCGAAGCGGCTTTGACCGCATCAACCACCTTTGCGTTA384                           ProThrArgThrGluAlaAlaLeuThrAlaSerThrThrPh eAlaLeu                             115120125                                                                     CGCCGGTATGACCTCGCCGGGCGCGCCTTATACGACCTCGATTTTTCG432                           ArgArgTyrAspLeuAlaGlyArgAlaLeuTyrAspLeuAspPheSer                              130 135140                                                                    AAGTTAAACCCGCAAACGCCCACGCGCGACCAAACCGGGCAGATCACC480                           LysLeuAsnProGlnThrProThrArgAspGlnThrGlyGlnIleThr                              145150 155160                                                                 TTTAACCCCTTTGGCGGCTTTGGTTTGAGTGGGGCTGCACCCCAACAG528                           PheAsnProPheGlyGlyPheGlyLeuSerGlyAlaAlaProGlnGln                              165 170175                                                                    TGAAACGAGGTCAAAAACAAGGTCCCCGTCGAGGTGGCGCAAGACCCC576                           TrpAsnGluValLysAsnLysValProValGluValAlaGlnAspPro                              180185 190                                                                    TCCAATCCCTACCGGTTTGCCGTTTTACTCGTGCCGCGCAGCGTGGTG624                           SerAsnProTyrArgPheAlaValLeuLeuValProArgSerValVal                              195200205                                                                     TACTAT GAGCAGTTGCAAAGGGGGTTGGGCTTACCACAGCAGCGAACC672                          TyrTyrGluGlnLeuGlnArgGlyLeuGlyLeuProGlnGlnArgThr                              210215220                                                                     GAGAGTGGTCAAAATACTT CCACCACCGGGGCAATGTTTGGCTTGAAG720                          GluSerGlyGlnAsnThrSerThrThrGlyAlaMetPheGlyLeuLys                              225230235240                                                                  GTGAAGAACGCCGAGGC GGACACCGCGAAGAGCAATGAAAAACTCCAG768                          ValLysAsnAlaGluAlaAspThrAlaLysSerAsnGluLysLeuGln                              245250255                                                                     GGCGCTGAGGCCACTGGTTCTTCAACCACA TCTGGATCTGGCCAATCC816                          GlyAlaGluAlaThrGlySerSerThrThrSerGlySerGlyGlnSer                              260265270                                                                     ACCCAACGTGGGGGTTCGTCAGGGGACACCAAAGTCAAGGCT TTAAAA864                          ThrGlnArgGlyGlySerSerGlyAspThrLysValLysAlaLeuLys                              275280285                                                                     ATAGAGGTGAAAAAGAAATCGGACTCGGAGGACAATGGTCAGCTGCAG912                           I leGluValLysLysLysSerAspSerGluAspAsnGlyGlnLeuGln                             290295300                                                                     TTAGAAAAAAATGATCTCGCCAACGCTCCCATTAAGCGGAGCGAGGAG960                           LeuGluLysAsnAs pLeuAlaAsnAlaProIleLysArgSerGluGlu                             305310315320                                                                  TCGGGTCAGTCCGTCCAACTCAAGGCGGACGATTTTGGTACTGCCCTT1008                          SerGlyGlnSer ValGlnLeuLysAlaAspAspPheGlyThrAlaLeu                             325330335                                                                     TCCAGTTCGGGATCAGGCGGCAACTCCAATCCCGGTTCCCCCACCCCC1056                          SerSerSerGlySerGlyGlyAsn SerAsnProGlySerProThrPro                             340345350                                                                     TGAAGGCCGTGGCTTGCGACTGAGCAAATTCACAAGGACCTCCCCAAA1104                          TrpArgProTrpLeuAlaThrGluGlnIleHisLysA spLeuProLys                             355360365                                                                     TGATCCGCCTCGATCCTGATTCTGTACGATGCGCCTTATGCGCGCAAC1152                          TrpSerAlaSerIleLeuIleLeuTyrAspAlaProTyrAlaArgAsn                              3 70375380                                                                    CGTACCGCCATTGACCGCGTTGATCACTTGGATCCCAAGGCCATGACC1200                          ArgThrAlaIleAspArgValAspHisLeuAspProLysAlaMetThr                              385 390395400                                                                 GCGAACTATCCGCCCAGTTGAAGAACGCCCAAGTGAAACCACCACGGT1248                          AlaAsnTyrProProSerTrpArgThrProLysTrpAsnHisHisGly                              405 410415                                                                    TTGTGGGACTGAAAGGCGCGCGATGTTTTGCTCCAAACCACCGGGTTC1296                          LeuTrpAspTrpLysAlaArgAspValLeuLeuGlnThrThrGlyPhe                              420425 430                                                                    TTCAACCCGCGCCGCCACCCCGAGTGGTTTGATGGCGGGCAGACGGTC1344                          PheAsnProArgArgHisProGluTrpPheAspGlyGlyGlnThrVal                              435440445                                                                     GCG GATAACGAAAAGACCGGGTTTGATGTGGATAACTCTGAAAACACC1392                         AlaAspAsnGlyLysThrGlyPheAspValAspAsnSerGluAsnThr                              450455460                                                                     AAGCAGGGCTTTCAA AAGGAAGCTGACTCCGACAAGTCGGCCCCGATC1440                         LysGlnGlyPheGlnLysGluAlaAspSerAspLysSerAlaProIle                              465470475480                                                                  GCCCTCCCGTTTG AAGCGTACTTCGCCAACATTGGCAACCTCACCTGG1488                         AlaLeuProPheGluAlaTyrPheAlaAsnIleGlyAsnLeuThrTrp                              485490495                                                                     TTCGGGCAAGCGCTTTTGGTGTTTGG TGGCAATGGCCATGTTACCAAG1536                         PheGlyGlnAlaLeuLeuValPheGlyGlyAsnGlyHisValThrLys                              500505510                                                                     TCGGCCCACACCGCGCCTTTGAGTATAGGTGTCTTTAGG GTGCGCTAT1584                         SerAlaHisThrAlaProLeuSerIleGlyValPheArgValArgTyr                              515520525                                                                     AATGCAACTGGTACCAGTGCTACTGTAACTGGTTGACCATATGCCTTA16 32                         AsnAlaThrGlyThrSerAlaThrValThrGlyTrpProTyrAlaLeu                              530535540                                                                     CTGTTCTCAGGCATGGTCAACAAACAAACTGACGGGTTAAAGGATCTA1680                          LeuPheSerG lyMetValAsnLysGlnThrAspGlyLeuLysAspLeu                             545550555560                                                                  CCCTTTAACAATAACCGCTGGTTTGAATATGTACCACGGATGGCAGTT1728                          ProPheAs nAsnAsnArgTrpPheGluTyrValProArgMetAlaVal                             565570575                                                                     GCTGGCGCTAAGTTCGTTGGTAGGGAACTCGTTTTAGCGGGTACCATT1776                          AlaGlyAlaLysPheValGly ArgGluLeuValLeuAlaGlyThrIle                             580585590                                                                     ACCATGGGTGATACCGCTACCGTACCTCGCTTACTGTACGATGAACTT1824                          ThrMetGlyAspThrAlaThrValProArgLeu LeuTyrAspGluLeu                             595600605                                                                     GAAAGCAACCTGAACTTAGTAGCGCAAGGCCAAGGTCTTTTACGCGAA1872                          GluSerAsnLeuAsnLeuValAlaGlnGlyGlnGlyLeuLeuArgG lu                             610615620                                                                     GACTTGCAACTCTTCACACCCTACGGATGAGCCAATCGTCCGGATTTA1920                          AspLeuGlnLeuPheThrProTyrGlyTrpAlaAsnArgProAspLeu                              625 630635640                                                                 CCAATCGGGGCTTGAAGTAGTAGTAGTAGTAGTAGTCACAACGCACCC1968                          ProIleGlyAlaTrpSerSerSerSerSerSerSerHisAsnAlaPro                              645 650655                                                                    TACTACTTCCACAATAACCCCGATTGACAAGACCGTCCAATCCAAAAT2016                          TyrTyrPheHisAsnAsnProAspTrpGlnAspArgProIleGlnAsn                              660665 670                                                                    GTGGTTGATGCCTTTATTAAGCCCTGAGAGGACAAGAACGGTAAGGAT2064                          ValValAspAlaPheIleLysProTrpGluAspLysAsnGlyLysAsp                              675680685                                                                     GATGCCAAATACATCTACCCTTACCGTTACAGTGGCATGTGAGCTTGA2112                          AspAlaLysTyrIleTyrProTyrArgTyrSerGlyMetTrpAlaTrp                              690695700                                                                     CAGGTATACAAC TGGTCCAATAAGCTCACTGACCAACCATTAAGTGCT2160                         GlnValTyrAsnTrpSerAsnLysLeuThrAspGlnProLeuSerAla                              705710715720                                                                  GACTTTGTC AATGAGAATGCTTACCAACCAAACTCCTTGTTTGCTGCT2208                         AspPheValAsnGluAsnAlaTyrGlnProAsnSerLeuPheAlaAla                              725730735                                                                     ATTCTCAATCCGGAATTGTTAG CAGCTCTTCCCGACAAGGTTAAATAC2256                         IleLeuAsnProGluLeuLeuAlaAlaLeuProAspLysValLysTyr                              740745750                                                                     GGTAAGGAAAACGAGTTTGCTGCTAACGAGTACGA GCGCTTTAACCAG2304                         GlyLysGluAsnGluPheAlaAlaAsnGluTyrGluArgPheAsnGln                              755760765                                                                     AAGTTAACGGTAGCTCCTACCCAAGGAACAAACTGATCCCACTTCTCC 2352                         LysLeuThrValAlaProThrGlnGlyThrAsnTrpSerHisPheSer                              770775780                                                                     CCCACGCTTTCCCGTTTCTCCACCGGGTTCAACCTTGTGGGGTCGGTG2400                          ProThr LeuSerArgPheSerThrGlyPheAsnLeuValGlySerVal                             785790795800                                                                  CTCGACCAGGTGTTGGATTATGTGCCCTGGATTGGGAATGGGTACAGG2448                          LeuA spGlnValLeuAspTyrValProTrpIleGlyAsnGlyTyrArg                             805810815                                                                     TATGGCAATAACCACCGGGGCGTGGATGATATAACCGCGCCTCAAACC2496                          TyrGlyAsnAsnHisAr gGlyValAspAspIleThrAlaProGlnThr                             820825830                                                                     AGCGCGGGGTCGTCCAGCGGAATTAGTACGAACACAAGTGGTTCGCGT2544                          SerAlaGlySerSerSerGlyIleSerThr AsnThrSerGlySerArg                             835840845                                                                     TCCTTTCTCCCGACGTTTTCCAACATCGGCGTCGGCCTCAAAGCGAAT2592                          SerPheLeuProThrPheSerAsnIleGlyValGlyLeuLys AlaAsn                             850855860                                                                     GTCCAAGCCACCCTCGGGGGCAGTCAGACGATGATTACAGGCGGTTCG2640                          ValGlnAlaThrLeuGlyGlySerGlnThrMetIleThrGlyGlySer                              865 870875880                                                                 CCTCGAAGAACCCTCGACCAAGCCAACCTCCAGCTCTGAACGGGGGCG2688                          ProArgArgThrLeuAspGlnAlaAsnLeuGlnLeuTrpThrGlyAla                              885 890895                                                                    GGGTGAAGGAATGATAAGGCTTCAAGTGGACAAAGTGACGAAAACCAC2736                          GlyTrpArgAsnAspLysAlaSerSerGlyGlnSerAspGluAsnHis                              90090 5910                                                                    ACCAAGTTCACGAGCGCTACGGGGATGGACCAGCAGGGACAATCAGGT2784                          ThrLysPheThrSerAlaThrGlyMetAspGlnGlnGlyGlnSerGly                              915920 925                                                                    ACCTCCGCGGGGAATCCCGACTCGTTAAAGCAGGATAATATTAGTAAG2832                          ThrSerAlaGlyAsnProAspSerLeuLysGlnAspAsnIleSerLys                              930935940                                                                     AGTGGGGA TAGTTTAACCACGCAGGACGGCAATGCGATCGATCAACAA2880                         SerGlyAspSerLeuThrThrGlnAspGlyAsnAlaIleAspGlnGln                              945950955960                                                                  GAGGCC ACCAACTACACCAACCTCCCCCCCAACCTCACCCCCACCGCT2928                         GluAlaThrAsnTyrThrAsnLeuProProAsnLeuThrProThrAla                              965970975                                                                     GATTGACCGAACGCGCTG TCATTCACCAACAAGAACAACGCGCAGCGC2976                         AspTrpProAsnAlaLeuSerPheThrAsnLysAsnAsnAlaGlnArg                              980985990                                                                     GCCCAGCTCTTCCTCCGCGGCTTGTTGGGCA GCATCCCGGTGTTGGTG3024                         AlaGlnLeuPheLeuArgGlyLeuLeuGlySerIleProValLeuVal                              99510001005                                                                   AATCGAAGTGGGTCCGATTCCAACAAATTCCAAGCCACCGACC AAAAA3072                         AsnArgSerGlySerAspSerAsnLysPheGlnAlaThrAspGlnLys                              101010151020                                                                  TGGTCCTACACCGACTTACATTCGGACCAAACCAAACTGAACCTCCCC3120                          T rpSerTyrThrAspLeuHisSerAspGlnThrLysLeuAsnLeuPro                             1025103010351040                                                              GCTTACGGTGAGGTGAATGGGTTGTTGAATCCGGCGTTGGTGGAAACC316 8                         AlaTyrGlyGluValAsnGlyLeuLeuAsnProAlaLeuValGluThr                              104510501055                                                                  TATTTTGGGAACACGCGAGCGGGTGGTTCGGGGTCCAACACGACCAGT3216                          TyrPheGlyA snThrArgAlaGlyGlySerGlySerAsnThrThrSer                             106010651070                                                                  TCACCCGGTATCGGTTTTAAAATTCCCGAACAAAATAATGATTCCAAA3264                          SerProGlyIleGlyPheLysI leProGluGlnAsnAsnAspSerLys                             107510801085                                                                  GCCACCCTGATCACCCCCGGGTTGGCTTGAACGCCCCAGGACGTCGGT3312                          AlaThrLeuIleThrProGlyLeuAlaTrpThrP roGlnAspValGly                             109010951100                                                                  AACCTCGTTGTCAGTGGCACCACGGTGAGCTTCCAGCTCGGCGGGTGG3360                          AsnLeuValValSerGlyThrThrValSerPheGlnLeuGlyGlyT rp                             1105111011151120                                                              CTGGTCACCTTCACGGACTTTGTCAAACCCCGCGCGGGTTACCTCGGT3408                          LeuValThrPheThrAspPheValLysProArgAlaGlyTyrL euGly                             112511301135                                                                  CTCCAGTTAACGGGCTTGGATGCAAGTGATGCGACGCAGCGCGCCCTC3456                          LeuGlnLeuThrGlyLeuAspAlaSerAspAlaThrGlnArgAlaLeu                              1140 11451150                                                                 ATTTGGGCCCCCCGGCCCTGAGCGGCCTTTCGTGGCAGTTGGGTCAAC3504                          IleTrpAlaProArgProTrpAlaAlaPheArgGlySerTrpValAsn                              11551160 1165                                                                 CGGTTGGGCCGCGTGGAGAGTGTGTGGGATTTGAAGGGGGTGTGGGCG3552                          ArgLeuGlyArgValGluSerValTrpAspLeuLysGlyValTrpAla                              11701175 1180                                                                 GATCAAGCTCAGTCCGACTCGCAAGGATCTACCACCACCGCAACAAGG3600                          AspGlnAlaGlnSerAspSerGlnGlySerThrThrThrAlaThrArg                              118511901195 1200                                                             AACGCCTTACCGGAGCACCCGAATGCTTTGGCCTTTCAGGTGAGTGTG3648                          AsnAlaLeuProGluHisProAsnAlaLeuAlaPheGlnValSerVal                              120512101215                                                                  GTGG AAGCGAGTGCTTACAAGCCAAACACGAGCTCCGGCCAAACCCAA3696                         ValGluAlaSerAlaTyrLysProAsnThrSerSerGlyGlnThrGln                              122012251230                                                                  TCCACTAACAGTTCCC CCTACCTGCACTTGGTGAAGCCTAAGAAAGTT3744                         SerThrAsnSerSerProTyrLeuHisLeuValLysProLysLysVal                              123512401245                                                                  ACCCAATCCGACAAGTTAGACGACGATC TTAAAAACCTGTTGGACCCC3792                         ThrGlnSerAspLysLeuAspAspAspLeuLysAsnLeuLeuAspPro                              125012551260                                                                  AACCAGGTTCGCACCAAGCTGCGCCAAAGCTTTGGTACAG ACCATTCC3840                         AsnGlnValArgThrLysLeuArgGlnSerPheGlyThrAspHisSer                              1265127012751280                                                              ACCCAGCCCCAGCCCCAATCGCTCAAAACAACGACAC CGGTATTTGGG3888                         ThrGlnProGlnProGlnSerLeuLysThrThrThrProValPheGly                              128512901295                                                                  ACGAGTAGTGGTAACCTCAGTAGTGTGCTTAGTGGTGGGGGTGCTGGA 3936                         ThrSerSerGlyAsnLeuSerSerValLeuSerGlyGlyGlyAlaGly                              130013051310                                                                  GGGGGTTCTTCAGGCTCAGGTCAATCTGGCGTGGATCTCTCCCCCGTT3984                          GlyGlyS erSerGlySerGlyGlnSerGlyValAspLeuSerProVal                             131513201325                                                                  GAAAAAGTGAGTGGGTGGCTTGTGGGGCAGTTACCAAGCACGAGTGAC4032                          GluLysValSerGlyTrpL euValGlyGlnLeuProSerThrSerAsp                             133013351340                                                                  GGAAACACCTCCTCCACCAACAACCTCGCGCCTAATACTAATACGGGG4080                          GlyAsnThrSerSerThrAsnAsnLeuAlaP roAsnThrAsnThrGly                             1345135013551360                                                              AATGATGTGGTGGGGGTTGGTCGACTTTCTGAAAGCAACGCCGCAAAG4128                          AsnAspValValGlyValGlyArgLeuS erGluSerAsnAlaAlaLys                             136513701375                                                                  ATGAATGACGATGTTGATGGTATTGTACGCACCCCACTCGCTGAACTG4176                          MetAsnAspAspValAspGlyIleValArgThrProLeuA laGluLeu                             138013851390                                                                  TTAGATGGGGAAGGACAAACAGCTGACACTGGTCCACAAAGCGTGAAG4224                          LeuAspGlyGluGlyGlnThrAlaAspThrGlyProGlnSerValLys                              1395 14001405                                                                 TTCAAGTCTCCTGACCAAATTGACTTCAACCGCTTGTTTACCCACCCA4272                          PheLysSerProAspGlnIleAspPheAsnArgLeuPheThrHisPro                              1410 14151420                                                                 GTCACCGATCTGTTTGATCCGGTAACTATGTTGGTGTATGACCAGTAC4320                          ValThrAspLeuPheAspProValThrMetLeuValTyrAspGlnTyr                              14251430 14351440                                                             ATACCGCTGTTTATTGATATCCCAGCAAGTGTGAACCCTAAAATGGTT4368                          IleProLeuPheIleAspIleProAlaSerValAsnProLysMetVal                              14451450 1455                                                                 CGTTTAAAGGTCTTGAGCTTTGACACCAACGAACAGAGCTTAGGTCTC4416                          ArgLeuLysValLeuSerPheAspThrAsnGluGlnSerLeuGlyLeu                              146014651470                                                                  C GCTTAGAGTTCTTTAAACCTGATCAAGATACCCAACCAAACAACAAC4464                         ArgLeuGluPhePheLysProAspGlnAspThrGlnProAsnAsnAsn                              147514801485                                                                  GTTCAGGTCAATC CGAATAACGGTGACTTCTTACCACTGTTAACGGCC4512                         ValGlnValAsnProAsnAsnGlyAspPheLeuProLeuLeuThrAla                              149014951500                                                                  TCCAGTCAAGGTCCCCAAACCTTGT TTAGTCCGTTTAACCAGTGACCT4560                         SerSerGlnGlyProGlnThrLeuPheSerProPheAsnGlnTrpPro                              1505151015151520                                                              GATTACGTGTTGCCGTTAGCGA TCACTGTACCTATTGTTGTGATTGTG4608                         AspTyrValLeuProLeuAlaIleThrValProIleValValIleVal                              152515301535                                                                  CTCAGTGTTACCTTAGGACTTGCCATTGGAATCC CAATGCACAAGAAC4656                         LeuSerValThrLeuGlyLeuAlaIleGlyIleProMetHisLysAsn                              154015451550                                                                  AAACAGGCCTTGAAGGCTGGGTTTGCGCTATCAAACCAAAAGGTTG AT4704                         LysGlnAlaLeuLysAlaGlyPheAlaLeuSerAsnGlnLysValAsp                              155515601565                                                                  GTGTTGACCAAAGCGGTTGGTAGTGTCTTTAAGGAAATCATTAACCGC4752                          ValL euThrLysAlaValGlySerValPheLysGluIleIleAsnArg                             157015751580                                                                  ACAGGTATCAGTCAAGCGCCAAAACGCTTGAAACAAACCAGTGCGGCT4800                          ThrGlyIleSerGlnA laProLysArgLeuLysGlnThrSerAlaAla                             1585159015951600                                                              AAACCAGGAGCACCCCGCCCACCAGTACCACCAAAGCCAGGGGCTCCT4848                          LysProGlyAlaP roArgProProValProProLysProGlyAlaPro                             160516101615                                                                  AAGCCACCAGTGCAACCACCTAAAAAACCCGCTTAG4884                                      LysProProValGlnProProLysLysProAlaEnd                                           16201625                                                                 

What is claimed is:
 1. A fragment of Mycoplasma pneumoniae P1 proteinconsisting essentially of SEQ ID No:1.
 2. A fragment of Mycoplasmapneumoniae P1 proetin consisting essentially of SEQ ID No:3.
 3. Afragment of Mycoplasma pneumoniae P1 protein consisting essentially ofSEQ ID No:5.
 4. A fragment of Mycoplasma pneumoniae P1 proteinconsisting essentially of SEQ ID No:7.
 5. A cytadhesin polypeptidefragment corresponding to the polypeptide encoded by lambda gt11 phageP1-7. P1-9 or P1-10, ATCC accession # 40386, 40385, or 40384,respectively.